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(54) Bacillus stearotherrnophilus DNA polymerase I (klenow) clones including those with 
reduced 3'-to-5' exonuclease activity 



(57) Disclosed and claimed are isolated nucleic acid 
molecules encoding Bacillus stearotherrnophilus DNA 
polymerase (DNApoll) , including the structural gene for 
DNApoll, such as DNApoll genes having insertions, de- 
letions, inactivation, or mutations at the 5' end thereof 
and thus encode Bst polymerase I enzymes which lack 
or have reduced 3'-5' exonuclease activity, as well as 
methods for making and using such nucleic acid mole- 
cules and such polymerases. For instance, the nucleic 
acid molecules are useful for making the polymerases, 
for example, by expressior of a vector comprising the 
nucleic acid molecules; and, the polymerases are useful 
in DNA sequencing and/or labelling. Thus, disclosed 
and claimed recombinant DNA clones corresponding to 



the Bacillus stearotherrnophilus DNA polymerase 
(DNApoll) structural gene with deletions at the 5' end. 
The polymerases from these recombinant DNA clones 
exhibit DNA synthesis domain activity but have reduced 
3' to 5' exonuclease activity. Further, minor modifica- 
tions at the 5' and 3' ends allow the clones to be manip- 
ulated by cloning and expressed as monomeric pep- 
tides. These deleted DNA clones give rise to truncated 
DNApoilK enzymes that are deficient in 3' to 5* exonu- 
clease activity and are useful in nucleic acid synthesis 
by primer extension reactions, particularly DNA labelling 
and DNA sequencing reactions. Furthermore, these 
clones may be combined with other clones in heterdlo 1 
gous constructs to create hybrid proteins. 
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Description 



INCLUDING THOSE WITH REDUCED 3'-TO-5' EXQNUCLEASE ACTIVITY CROSS REFERENCE TO RELATED 
APPLICATIONS 

This applicatbn claims priority from Singapore Application No. 9701158-9, filed April 10, 1 997, incorporated herein 
by reference. 

FIELD OF THE INVENTION 

The present invention relates: to a genomic clone of Bacillus stearothermophilus, such as a gene or isolated nucleic 
acid molecule encoding a DNA polymerase from Bacillus stearothermophilus; to Kl enow- 1 ike fragments therefrom; to 
expression products thereof; to uses thereof, e.g., in manual or automated DNA sequencing and/or labeling; to isolated 
nucleic acid molecules or genes from Bacillus stearothermophilus, e.g., from the Bacillus stearothermophilus ("Bst 1 ) 
DNA polymerase (DNApoll) gene, having deletions therein, e.g., from the 5' end; to expression products thereof, e.g., 
which are 3' to 5' exonucl ease-deficient or which have reduced 3' to 5' activity; and to uses thereof, e.g., in manual or 
automated DNA sequencing and/or labeling. 

Thus, the invention also pertains to Bst DNA polymerases which have reduced 3' to 5' exonuclease activity or 
which are deficient in 3' to 5' exonuclease activity, as well as to uses thereof in manual or automated DNA sequencing 
and/or labeling; and, the invention pertains to isolated nucleic acid molecules encoding Bst polymerases having re- 
duced 3* to 5' activity (e.g., portions of the Bst DNApoll gene). 

The isolated nucleic acid molecules or genes from Bst DNApoll which are truncated at the 5' end (in comparison 
to full length Bst DNApoll) are also herein called "deleted clones" and give rise to truncated fl stearothermophilus DNA 
polymerase (DNApoll) enzymes that are deficient in 3' to 5' exonuclease activity or have reduced 3' to 5' exonuclease 
activity and are thus useful in nucleic acid synthesis by primer extension reactions, particularly DNA sequencing and 
DNA labelling reactions. These deleted clones may be combined with other isolated nucleic acid molecules, e.g., other 
deleted clones, to create hybrid proteins with novel properties. 

Accordingly, the invention also relates to vectors containing and/or expressing the isolated nucleic acid molecules, 
and to methods for expressing the isolated nucleic acid molecules, including isolating the polymerase products from 
such expression; and, to methods for sequencing or labeling DNA including such polymerase products. 

Several documents are cited in the following text, with either full citation where the document is cited, or with full 
citation appearing in a Document List prior to the claims. These documents relate to the state of the art to which this 
invention pertains, and each and every document cited in the following text, as well as all documents cited in each and 
every document cited in the following text, are hereby incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

DNA polymerases synthesize DNA molecules in the 5' to 3' direction from deoxynucleoside triphosphates (nucle- 
otides) using a complementary template DNA strand and a primer by successively adding nucleotides to the free 3'- 
hydroxyl group of the growing strand. The template strand determines the order of addition of nucleotides via Watson- 
Crick base pairing. In cells, DNA polymerases are involved in DNA repair synthesis and replication (Kornberg, 1974, 
In DNA Synthesis. W.H. Freeman, San Francisco). 

Escherichia coliDNA polymerase I (DNApoll) and other homologous polymerases have three enzymatic functions: 
i) a 5' to 3' exonuclease activity, ii) a 3* to 5' exonuclease activity and iii) a DNA synthesis activity. The latter two functions 
are located towards the COOH-end of the protein, within a 'Klenow 1 fragment (DNApollK). Enzyme preparations of E. 
cofi DNA pol I can be treated with subtilisin to yield a E. coli DNApollK minus the 5* to 3* exonuclease activity (see 
Brown et al., 1982, J Biol. Chem. 257: 1965-1972; Joyce et al., 1982, J. Biol. Chem. 257: 1958-1964; Joyce at al., 
1983, Proc Natl Acad Sci USA 80: 1830-1834; Klenow & Henningsen, 1970, Proc. Natl. Acad. Sci. USA 65: 168-175; 
Kornberg, 1 974; Setlow, P. et al., 1 972, J. Biol. Chem. 247: 224-231 ; Setlow and Kornberg, 1 972, J. Biol. Chem. 247: 
232-240; Steitz and Joyce, 1987, In Protein Engineering, Chap. 20, pp. 227-235. Oxender, D.A., and Fox, C.F (eds). 
Alan R. Liss, New York). It has been shown that the DNApoll gene is unstable, although the DNApollk gene has been 
cloned and is stable (Joyce et al., 1982, J. Biol. Chem. 257:1958-64; Joyce et al., 1983, PNAS USA 80:1830-34). 

Research on E. coli DNApollK has indicated that the sites of the 3* to 5 1 exonuclease and DNA synthesis activities 
may be separated as their corresponding substrates do not compete. The DNA synthesis active site binds to double- 
stranded DNA containing a single-stranded 5* extension and deoxynucleoside triphosphate (dNTP) whereas the 3' to 
5' exonuclease active site binds to deoxynucleoside monophosphate (dNMP) (Ollis et al., 1985, Nature 31 3: 762-766). 
The existence of a conserved 3' to 5' exonuclease active site present in a number of DNA polymerases was predicted 
by Bernat et al., 1989, Cell 59: 219-228; Blanco et al., 1992, Gene 112: 139-144; Reha-Krantz, L.J., 1992, Gene 112: 
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133-137. 

Crystal structure analysis of £ coli DNApollK has shown that its peptide chain is folded into two distinct domains, 
with the smaller domain of 200 amino acid residues being the 3* to 5* exonuclease domain and the other domain of 
400 amino acid residues being the DNA synthesis domain (Ollis, 1985). 

Evidence using modified DNA substrates further supports the hypothesis that the 3* to 5' exonuclease domain is 
separate from the DNA synthesis domain (Cowart et al.,1988, Biochem. 20: 1973-1983). This view is also supported 
by base mutations and deletions in related T4 and T5 DNA polymerases (Frey et al., 1992, Proc. Natl. Acad. Sci. USA 
90: 2579-2583; Leavitt et al., 1989, Proc. Natl. Acad. Sci USA 86: 4465-4469; Spacciapoli, P. and Nossal, N.G., 1994, 
J. Biol. Chem. 269: 438-446). 

Both the 3' to 5' exonuclease and the DNA synthesis domains of E. coli DNA polymerase I have been cloned, 
expressed, and characterized independently. The 3' to 5* exonuclease domain consists of approximately 200 amino 
acids. The DNA synthesis domain contains approximately 400 amino acids. There is, however, 50-fold less activity in 
the DNA synthesis domain when compared to the £ coli DNApollK (Morrison et al., 1991, Proc. Natl. Acad Sci. USA, 
88:9473-9477). - - 

Bacillus stearothermophilus. \s a mesophilic bacterium. The DNApoll of Bacillus stearothermophilus displays opti- 
mum activity at 67°C. Its amino acid sequence shows a high level of homology to that of £ coli poll. Other similarities 
include the presence ofjhree active domains: 5' to 3' exonuclease, 3* to 5' exonuclease, and DNA synthesis (Kaboev. 
etal.. 1981, J. Bacterid. 145: 21-26; Phang et-ai., 1995, Gene 163; 65-68; Stenesh and Roe, 1972, Biochim. Biophys. 
Acta 272: 156-166; Ye and Hong, 1987, Scientia Sinica 30: 503-506). Limited proteolysis with subtilisin also results in 
a small fragment with the 5' to 3' exonuclease activity and a large Klenow-like fragment (Lu et al., 1 991 , BioTechniques 
11: 465-466; McClary etaL, 1991, DNA Sequence 1: 173-180; Phang etal., 1995). The B. stearothermophilus DNApoll 
large Klenow-like fragment has been proposed to contain a domain for the 3* to 5' exonuclease activity (Ye & Hong, 
1987), 

Many molecular cloning techniques and protocols involve the synthesis of DNA in in vitro reactions catalyzed by 
DNA polymerases. For example, DNA polymerases are used in DNA labelling and DNA sequencing reactions, using 
either 35S-, 32P-or 33P-labelled nucleotides. Most of these enzymes require a template and primer, and synthesize 
a product whose sequence is complementary to that of the template. The 5' to 3' exonuclease activity of £ coli DNA 
polymerase I is often troublesome in these reactions because it degrades the 5' terminus of primers that are bound to 
DNA templates and removes 5' phosphates from the termini of DNA fragments that are to be used as substrates for 
ligation. The use of a DNA polymerase for these labelling and sequencing reactions thus depends upon the removal 
of the 5' to 3" exonuclease activity. 

The 5' to 3* exonuclease activity can be removed proteolytically from the holoenzyme without affecting either the 
polymerase activity or the 3' to 5" exonuclease activity. The klenow fragment of £ co/i'DN A polymerase I that is available 
today from commercial sources consists of a single polypeptide chain produced by cleavage of intact DNA polymerase 
I with subtilisin or by cloning. This creates a DNApollK containing only the 3' to 5' exonuclease and DNA synthesis 
activities (without the 5' to 3' exonuclease activity) and allows for net DNA processivity. 

DNA processivity is performed by heat denaturation of a DNA template containing the target sequence, annealing 
of a primer to the DNA strand and extension of the annealed primer with a DNA polymerase. At low temperatures of 
about 37°C, the DNA may be insufficiently denatured and secondary structures may impede DNA processivity. This 
results in stoppage of primer extension (insufficient labelling in DNA labelling) or errors (e.g., band compression or 
cross-banding artifacts in DNA sequencing). Band compression is artifacts seen on sequencing gels when there is a 
run of G-C rich regions. This may be overcome by adding deaza- or inosine-nucleotides to substitute for guanosine 
nucleotides or preferably by performing DNA processivity at elevated temperatures. Cross-banding artifacts are caused 
by premature termination, mismatches, or both. 

The incorporation of a thermostable subtilisin-treated DNA polymerase, such as that from Bacillus stearother- 
mophilus DNApoll, into DNA sequencing reactions allows for these secondary structure artifacts to be overcome. B. 
stearothermophilus DNApoll has a temperature optimum of 67°C and exhibits an ideal banding pattern with minimal 
or no crossbanding, due to the presence of a 3' to 5' exonuclease activity present in the enzyme. Crossbanding produces 
unreadable sequencing gels. 

The subtilisin-digested B. stearothermophilus DNA polymerase I (DNApoll) is distinguished from other DNA 
polymerases by its high degree of DNA processivity and fidelity (Lu et al., 1991, BioTechniques 11: 465-466; McClary 
et aL, 1991, DNA Sequence 1: 173-180; Mead et al., 1991, Biotechniques 11: 76-87; Ye & Hong, 1987, Sci. Sin. 30: 
503-506). This is exemplified by readable DNA sequences on X-ray films of at least 350 bases with little or no cross- 
banding. In contrast, other DNA polymerases may have high degree of processivity but with poor fidelity, resulting in 
errors or cross-banding artifacts. 

The high degree of polymerase activity is due to the DNA synthesis domain. The high fidelity conferred upon S. 
stearothermophilus DNApoll may be due to its proposed 3'-5* exonuclease activity (proofreading or editing function), 
which is proposed to reside independently of the DNA synthesis activity in a similar fashion to other Type I DNA polymer- 
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ases as mentioned earlier; but Bst DNApoll may lack functional 3' -5* exonuclease activity (see Stellmann et al, infra). 

Alternatively, high fidelity may not be due to the 3' -5' exonuclease activity, but rather, may be an inherent property 
of the DNA synthesis domain B. stearothermophilus; e.g., tighter binding of the enzyme to the DNA template may 
confer higher fidelity by preventing mismatches from occurring. 

5 The concept of net DNA processivity, however, is the ratio ofcDNA synthesis activity versus 3'-5' exonuclease 

activity. DNA synthesis enzyme (synthetase) acts to polymerize nucleotides while 3'-5' exonuclease has an editing or 
proof-reading function. Thus high DNA synthesis is generally achieved at the expense of high fidelity and vice versa. 
The 3'-to-5' exonuclease activity of many DNA polymerases may, therefore, be disadvantageous in situations where 
one is trying to achieve net synthesis of DNA. 

16 No document discussing DNA polymerase from B. stearothermophilus relates to a DNA polymerase enzyme which 

exhibits reduced 3* to 5' exonuclease activity and which can give superior results in DNA amplification techniques and 
primer extension reactions. Further, prior documents do not appear to fully appreciate a genomic clone of Bacillus 
stearothermophilus, such as a gene or isolated nucleic acid molecule encoding a DNA polymerase from Bacillus stearo- 
thermophilus; Klengw-like fragments4fom expression thereof; or uses thereof, e.g., in manual or automated DNA se- 

ts quencing and/or labeling. — - 

There is thus a need for aJBstearothermophilus DNA polymerase I enzyme that has reduced 3* to 5* exonuclease 
activity or which is deficient in 3' to 5' exonuclease activity and can participate in various DNA amplification schemes, 
including DNA sequencing and labelling, manual or automated. There is further a need for uses of the genomic clone 
of Bst, e.g., uses for nucleic acid molecules which encode a DNA polymerase from Bst as well as for Klenow-like 

^0 fragments therefrom and expression thereof; for example there is a need for producing DNA polymerases from Bst t 
and a need in manual or automated DNA sequencing and/or labeling for such polymerases. 

OBJECTS AND SUMMARY OF THE INVENTION 

This invention provides for B. stearothermophilus DNApoll derivatives with reduced 3'-5' exonuclease activity or 
which are deficient in 3* to 5' exonuclease activity. The present invention further provides nucleic acid molecules en- 
coding such Bst DNApoll derivatives. 

This invention also provides modular DNA polymerases with defined enzymatic domains which can be used in 
any combinations to create hybrid proteins with novel properties. 
30 The present invention also provides a genomic clone of Bst which is useful for providing nucleic acid molecules 

which encode a DNA polymerase from Bst t as well as for providing Klenow-like fragments therefrom and expression 
products thereof. For example, this invention provides a means for producing DNA polymerases from Bst, and manual 
or automated DNA sequencing and/or labeling using such polymerases. 

Accordingly, the invention has many objectives, which can include one or more of: 



Providing nucleic acid molecules which encode a DNA polymerase from Bsr, such as a nucleic acid molecule 
which with reference to full length Bst DNApoll has deletions therein, e.g., from the 5' end; a vector such as a 
recombinant containing the nucleic acid molecule, e.g., a recombinant containing anoVor expressing a nucleic acid 
molecule encoding a Bacillus stearothermophilus DNA polymerase I such as a truncated Bacillus stearothermophil- 
us DNA polymerase I (from expression of a Bst DNApoll nucleic acid molecule having deletions therein, e.g., from 
the 5' end), for instance a Bst DNA polymerase I which is deficient in or has reduced 3' to 5' exonuclease activity; 
Providing B. stearothermophilus DNApoll derivatives (Klenow clones) with increasing deletions from the 5" end 
exhibiting reduced 3' to 5* exonuclease activity or lacking 3 1 to 5' exonuclease activity; and 
Providing B. stearothermophilus DNA polymerase I Klenow-like fragments with reduced 3' to 5' exonuclease activity 
or which are deficient in 3' to 5' exonuclease activity, that can participate in various DNA amplification schemes 
including DNA sequencing and DNA labelling reactions. 

The use of the expression products of DNApoll clones or of DNApoll clones (including expression products from 
trucated or deleted versions of DNApoll or truncated or deleted versions of DNApoll clones) in manual or automated 
DNA sequencing and labelling techniques, e.g., providing methods for DNA sequencing or labelling including a Bst 
DNA polymerase from expression of an isolated nucleic acid molecule coding therefor, such as a Bst polymerase 
having reduced 3' to 5' exonuclease activity or which is deficient in such activity from expression of a truncated or 
deleted DNApoll. 

Providing combined B. stearothermophilus DNApoll genes or portions thereof so as to express tandem or chimeric 
Bst polymerases with reduced 3' to 5' exonuclease activity or which are deficient in 3' to 5* exonuclease activity or 
providing Bst DNApol genes or portions thereof combined with DNA encoding other modular catalytic domains which 
upon expression yields tandem or chimeric novel proteins, 

Accordingly, this present invention provides nucleic ackJ molecules encoding B. stearothermophilus DNA polymer- 
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ase I enzymes and Klenow-like fragments therefrom and Bst DNA polymerase I enzymes from expression thereof, 
such as Bst DNA polymerase I enzymes having improved DNA processivity by removal of some or ail of the 3* to 5' 
exonuclease activity of the enzyme, e.g, by deleting portions of the 5' end of the DNA polymerase gene (DNApoll). 
The present invention also provides methods for making such expression products or Klenow-like fragments, e.g., by 

5 inserting the nucleic acid molecules into a vector and isolating the expression products thereof; and, for- using such 
expression products or Klenow-like fragments, e.g., in DNA sequencing or labelling. 

The recombinant B. stearothermophilus DNA polymerase I enzymes, especially those with reduced 3* to 5' exo- 
nuclease activity, are not only useful in DNA synthesis and labelling reactions, but have wider applications in nucleotide 
amplification schemes; for instance, they can be used with nucleotides or their analogs which otherwise would not be 

to incorporated because of the editing 3' to 5' exonuclease activity. Furthermore, these enzymes can be used in automated 
sequencing and labelling reactions. 

The present invention, in certain particular embodiments, provides a series of B. stearothermophilus DNApoll 
structural genes of 1781 (PK), 1526 (P1) ( 1281 (P2), 1043 (P3) and 791 (P4) bp respectively, derived from the S. 
stearothermophilus DNApoll gene but with increasing deletions at the 5' end (deleted clones of B. stearothermophilus 

'5 DNApoll) (2859 bp for genomic clone minus 1077, 1 332, 1577, 1815 and 2067 bp, respectively; start positions re; Fig. 
1 are plus one (1) bp, i.e., at 1078, 1333, 1578, 1816, and 2068, respectively). These genes encode Bst DNA polymer- 
ase enzymes which are 3' to 5' exonuclease -deficient or which have reduced 3* to 5' exonuclease activity or which may 
lack 3' to 5' exonuclease function. One can also obtain these Bst DNA polymerase enzymes which are 3' to 5' exdnu- 
clease-deficient or which have reduced 3' to 5' exonuclease activity by insertional inactivatbn or other means for in- 

20 terrupting expression of the full length Bst DNApoll gene such that the product ultimately expressed is the same as or 
homologous to the product from the bp 1781 (PK), 1526 (P1), 1281 (P2), 1043 (P3) and 791 (P4)&f genes: Accordingly, 
one can use 5' end deletions, or interruptions such as insertional inactivation, to cause expression of pieces of DNApoll, 
such as the 1781 (PK), 1526 (P1) t 1281 (P2), 1043 (P3) and 791 (P4) bp portions of DNApoll, to thereby obtain Bst 
DNA polymerase enzymes which are 3' to 5' exonuclease-deficient or which have reduced 3' to 5' exonuclease activity, 

25 or which may lack 3' to 5' exonuclease function. The Klenow fragment (Bsr pollK; 593 aa starting at aa position 286, 
encoded by 1781 bp fragment starting at position 1078) is proposed to contain the 3' to 5' exonuclease and DNA 
synthesis domains but may lack 3' to 5' exonuclease function (Stellmann et aL, J. Bacteriol. 1992, 174:4350-55). 

Thus, the deleted clones give rise to a series of truncated B. stearothermophilus DNA polymerase I enzymes that 
are deficient in 3' -5' exonuclease activity or have reduced 3* -5* exonuclease activity and are useful in nucleic acid 

30 synthesis by primer extension reactions, particularly DNA cycle-sequencing reactions. Furthermore, deleted clones 
may be combined with each other to give novel enzymes, or with other nucleic acid molecules encoding catalytic 
domains, to give rise to chimeric enzymes with novel properties. - - ■ - 

Thus, the present invention contemplates recombinant thermostable B. stearothermophilus DNA polymerase I 
enzymes with DNA synthesis activity but deficient in 3' to 5' exonuclease activity or having reduced 3' to 5* exonuclease 

55 activity. Preferably, the polymerase enzymes have an amino acid residue sequence represented by the formula shown 
in FIG. 1 (SEQ ID NOS: 1 and 2) from residue 371 (e.g., residues 1 to 370 are deleted), or from residue 453 (e.g., 
residues 1 to 452 are deleted), or from residue 532 (e.g., residues 1 to 531 are deleted), or from residue 616 (e.g., 
residues 1 to 615 are deleted). Additionally, the invention comprehends the Bst Klenow fragment which starts frbfn 
residue 236 (e.g., residues 1 to 285 are deleted). The polymerases can culminate at residue 879. 

4a Accordingly, the invention comprehends nucleic acid molecules encoding Bst polymerase I, such as DNA polymer- 

ase I having reduced or no or which may lack 3' to 5' exonuclease activity, e.g., nucleic acid molecules encoding 
polymerase enzymes have an amino acid residue sequence represented by the formula shown in FIG. 1 (SEQ ID 
NOS: 1 and 2) from residue 286 (e.g., residues 1 to 285 are deleted), or from residue 371 (e.g., residues 1 to 370 are 
deleted), or from residue 453 (e.g., residues 1 to 452 are deleted), or from residue 532 (e.g., residues 1 to 531 are 

45 deleted), or from residue 616 (e.g., residues 1 to 615 are deleted); for instance, an isolated nucleic acid molecule 
wherein nucleotides 1 to 1077 are deleted or interrupted, or an isolated nucleic acid molecule wherein nucleotides 1 
to 1 332 are deleted or interrupted, or an isolated nucleic acid molecule wherein nucleotides 1 to 1 577 are deleted or 
interrupted t or an isolated nucleic acid molecule wherein nucleotides 1 to 181 5 are deleted or interrupted, or an isolated 
nucleic acid molecule wherein nucleotides 1 to 2067 are deleted or interrupted; or wherein "reading" begins at nt 1078, 

50 1333, 1578, 1816, or 2068. 

This invention also provides to vectors such as plasmids containing a Bst gene, -e.g., Bst DNApoll, or a functional 
portion of Bst DNApoll which encodes a 3' to 5' exonuclease deficient B. stearothermophilus DNA polymerase or a B. 
stearothermophilus DNA polymerase having reduced 3' to 5' exonuclease activity, and to cells, such as prokaryotic 
cells, transformed with a vector, such as a plasmid, of this invention. 

55 Also, the invention comprehends modular DNA polymerases with defined enzymatic domains which can be com- 

bined to create hybrid proteins with novel properties. 

These and other objects and embodiments are disclosed or are obvious from and encompassed by, the following 
Detailed Description. 
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The following Detailed Description, given by way of example, but not intended to limit the invention to specific 
embodiments described, may be understood in conjunction with the accompanying Figures, incorporated herein by 
reference, in which: 



FIG. 1 shows the nucleotide (SEQ ID NO: 1 ) and peptide (SEQ ID NO: 2) sequences of the B. stearothermophilus 
polymerase I (see Phang et al., 1 992, Gene 163: 65-68; the methionine amino acid under the ATG start codon is 
in bold; the 5'-end primer sequences were designed to prime onto the underlined sequences and are labelled PK, 
P1, P2, P3 and P4 respectively; the 3'-end primer was the same for all clones and was based on a sequence 
located on the plasmid vector pTrc99A (see FIG. 2 and Example 3); and the Ncol internal site is indicated by a 
dotted line); 

FIG. 2 shows a schematic diagram, not necessarily to scale, of deleted B. stearothermophilus DNApolI clones 
obtained by DNA amplification using specific 5' primers labelled PK, P1, P2, P3, P4 and a 3' primer labelled P5 
(clones are labelled PV81G8 (PK), PV8169, PV8170, PV8171, PV8172); and, 

FIG. 3 shows a schematic diagram of the cloning scheme employed to obtain the 5' deleted B. stearothermophilus 
DNApolI clones as described in the Examples. 

DETAILED DESCRIPTION 

Definitions 



The following terms are defined in order to provide a clear and consistent understanding of their use in the spec- 
ification and the claims. Other terms are well known in the art so that they need not be defined herein. 
Amino acid residues are numbered according to Phang et al., 1995, Gene 163: 65-68. 

The "amino acid residues" described herein are preferred to be in the "L" isomeric form. However, residues in 
the "D" isomeric form can be substituted for any amino acid residue, as long as the desired functional property is 
retained by the polypeptide. NH 2 refers to the free amino group present at the amino terminus of a polypeptide. COOH 
refers to the free carboxy group present at the carboxy terminus of a polypeptide. The amino-terminal NH 2 group and 
carboxy-terminal COOH group of free polypeptides are typically not set forth in a formula. A hyphen at the amino- or 
carboxy- terminus of a sequence indicates the presence of a further sequence of amino acid residues or a respective 
NH 2 or COOH terminal group. 

In keeping with standard polypeptide nomenclature, J. Biol. Chem., 243: 3552-3559 (1969) and adopted at 37 
CFR §1 .822(b) (2), abbreviations for amino acid residues are shown in the following Table of Correspondence: 



Table of Correspondence 





Symbol 


Amino acid 


Y 


Tyr 


tyrosine 


G 


Gly 


glycine 


F 


Phe 


phenylalanine 


M 


Met 


methionine 


A 


Ala 


alanine 


S 


Ser 


serine 


I 


lie 


isoleucine 


L 


Leu 


leucine 


T 


Thr 


threonine 


V 


Val 


valine 


P 


Pro 


proline 


K 


Lys 


lysine 


H 


His 


histidine 


Q 


Gin 


glutamine 


E 


Glu 


glutamic acid 


W 


Trp 


tryptophan 


R 


Arg 


arginine 


D 


Asp 


aspartic acid 
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Table of Correspondence (continued) 





Symbol 


Amino acid 


N 


Asn 


asparagine 


C 


Cys 


cysteine 



10 



15 



20 



25 



30 



35 



45 



50 



55 



It should be noted that alt amino acid residue sequences are represented herein by formulae whose left and right 
orientation is in the conventional direction of amino-terminus to carboxy-terminus. 

An "analog' of a molecule such as 8. stearothermophilus DNApoll is meant to refer to a molecule substantially 
similar in function to either the entire molecule or to a fragment thereof. 

The term "base pair (bp)" relates to a partnership of adenine (A) with thymine (T), or of cytosine (C) with guanine 
(G) in a double-stranded DNA molecule. 

As used herein, "cell", "cell line", and "cell culture" can be used interchangeably and all such designations include 
progeny. Thus, the words transfomnants" or "transformed cells" include the primary subject cell and cultures derived " 
therefrom without regard for the number of transfers. It is also understood that all progeny may not be precisely identical 
in DNA content, due to deliberate or inadvertent mutations. Mutant progeny that have the same functionality as 
screened for in the originally transformed cell are included. 

As used herein, a molecule is said to be a "chemical derivative" of another molecule when it contains additional 
chemical moieties not normally a part of the molecule. Such moieties may improve the molecule's solubility, absorption, 
biological half life, etc. 

"Clone" refers to any entity capable of replicating itself. As used herein, it refers to both the structural gene as well 
as to the replicated protein. 

The term "control sequenced" refers to DNA sequences necessary for the expression of an operably linked coding 
sequence in a particular host organism. The control sequences that are suitable for prokaryotes, for example, include 
a promoter, optionally an operator sequence, a ribosome binding site, and the like. Eukaryotic cells are known to utilize 
promoters, polyadenylation signals, and enhancers. 

A "DNA polymerase substantially reduced in 3*-5* exonuclease activity" is defined herein as either (1) a mu- 
tated DNA polymerase that has about or less than the specific activity of the corresponding unmutated, wild-type 
enzyme, or (2) a DNA polymerase having a 3' to 5' exonuclease specific activity less than about 1 unit/mg protein, or 
preferably about or less than 0.1 units/mg protein. A unit of activity of 3' to 5' exonuclease is defined as the amount of 
activity that solubilizes 10 nmoles of substrate ends in 60 min. at 37°C, assayed as described in the "BRL 1989 
Catalogue & Reference Guide", page 5, with Hhal fragments of lambda DNA 3'-end labeled with [3H]dTTP by terminal 
deoxynucleotidyl transferase (TdT). Protein concentration is measured by the method of Bradford, 1 976, Anal. Bio- 
chem. 72: 248. 

"Expression" is the process by which a promoter/structural gene produces a polypeptide. It involves transcription 
of the gene into messenger RNA (mRNA) and the translation of such mRNA into a polypeptide. 

The term "expression system" refers to DNA sequences containing a desired coding sequence and control se- 
quences in operable linkage, so that hosts transformed with these sequences are capable of producing the encoded 
proteins. In order to effect transformation, the expression system may be included on a vector; however, the relevant 
DNA may then also be integrated into the host chromosome. 

"3* to 5' exonuclease activity" is an enzymatic activity well known to the art. This activity is often associated with 
DNA polymerases, and is thought to be involved in a DNA replication "editing" or correction mechanism. 

A "fragment" of a molecule such as B. stearothermophilus DNApoll, is meant to refer to any polypeptide subset 
of the molecule. 

A "functional derivative" of B. stearothermophilus DNApoll is a compound which possesses a biological activity 
(either functional or structural) that is substantially similar to a biological activity of B. stearothermophilus DNApoll . The 
term "functional derivative" is intended to include the "fragments" "variants," "analogues," or "chemical derivatives" of 
a molecule. 

"Gene" is a DNA sequence that contains information necessary to express a polypeptide or protein. The term 
"gene" as used herein refers to a DNA sequence that encodes a polypeptide. 

"Heterologous" refers to two DNA segments having different origins, i.e. not, in nature, being genetically or phys- 
ically linked to each other. Heterologous also describes molecules that are in nature physically or genetically linked 
together but which are linked together in a substantially different way than is found in nature. 

"Homology", as used herein, refers to the comparison of two different nucleic acid sequences. For the present 
purposes, assessment of homology is as a percentage of identical bases, not including gaps introduced into the se- 
quence to achieve good alignment. Percent homology may be estimated by nucleic acid hybridization techniques, as 
well as by determining and comparing the exact base order of the two sequences. 
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"Host" is any prokaryotic or eukaryotic microorganism that is the recipient of a DNA molecule. The DNA molecule 
may contain, but is not limited to, a structural gene, a promoter and/or an origin of replication. 

"Mutation" is any change that alters the DNA sequence. As used herein, a mutated sequence may have single 
or multiple changes that alter the nucleotide sequence of the DNA Alterations of the DNA sequence include deletions 
5 (loss of one or more nucleotides in the DNA sequence) and/or substitutions (substituting a different nucleotide for the 
original nucleotide along the DNA sequence). 

"Nucleotide" is a monomeric unit of DNA or RNA consisting of a sugar moiety (pentose), a phosphate, and a 
nitrogenous heterocyclic base. The base is linked to the sugar moiety via the glycosidic carbon ( 1 1 carbon ol the pentose) 
and that combination of base and sugar is a nucleoside. When the nucleoside contains a phosphate group bonded 
10 to the 3' or 5' position of the pentose, it is referred to as a nucleotide. A sequence of operatively linked nucleotides is 
typically referred to herein as a "base sequence" or "nucleotide sequence", and is represented herein by a formula 
whose left to right orientation is in the conventional direction of 5* terminus to 3* terminus. Each nucleotide is charac- 
terized by its base. The four DNA bases are adenine (A), guanine (G), cytosine (C), and thymine (T). The four RNA 
bases are A, G, C and uracil (U) 
*s The term "oligonucleotide" as used herein is defined as a molecule composed of two or more deoxyribonucle- 

otides and/or ribonucleotides. Its exact size will depend on many factors, which in turn depend on the ultimate function 
or use of the oligonucleotide. The oligonucleotide may be derived synthetically or by cloning. 

"Operably linked" refers to juxtaposition such that the normal function of the components can be performed. Thus, 
a coding sequence "operably linked" to control sequences refers to a configuration wherein the coding sequences can 
20 be expressed under the direction of the control sequences. 

"Origin of replication" refers to a DNA sequence from which DNA replication is begun, thereby allowing the DNA 
molecules which contain said origin to be maintained in a host, i.e., replicate autonomously in a host cell. 

The term "primer" as used herein refers to an oligonucleotide, whether occurring naturally or produced syntheti- 
cally, which is capable of acting as a point of initiation of nucleic acid synthesis when placed under conditions in which 
25 synthesis of a primer extension product which is complementary toa nucleic acid strand is induced, i.e., in the presence 
of four different nucleotide triphosphates and a polymerase enzyme in an appropriate buffer. 

The primer is preferably single-stranded for maximum efficiency in DNA amplification, but may alternatively be 
double-stranded. If double-stranded, the primer is first treated to separate its strands before being used to prepare 
extension products. Preferably, the primer is an oligodeoxyribonucleotide. The primer must be sufficiently long to prime 
30 the synthesis of extension products in the presence of a polymerase enzyme. The exact lengths of the primers will 
depend on many factors, including temperature, source of primer and use of the method. For example, depending on 
the complexity of the target sequence, the oligonucleotide primer typically contains 15-25 nucleotides although it may 
contain more or fewer nucleotides. Short primer molecules generally require colder temperatures to form sufficiently 
stable hybrid complexes with the template. 
35 The primers herein are selected to be "substantially" complementary to the different strands of each specific se- 

quence to be amplified. This means that the primers must be sufficiently complementary to hybridize with their respec- 
tive strands. Therefore, the primer sequence need not reflect the exact sequence of the template. For example, a non- 
complementary nucleotide fragment may be attached to the 5' end of the primer, with the remainder of the primer 
sequence being complementary to the strand. Alternatively, non-complementary bases or longer sequences can be 
40 interspersed into the primer, provided that the primer sequence has sufficient complementarily with the sequence of 
the strand to be amplified to hybridize therewith and thereby form a template for synthesis of the extension product of 
the primer However, for detection purposes, particularly using labeled sequence-specific probes, the primers typically 
have exact complementarily to obtain the best results. 

Generally, the synthesis will be initiated at the 3' end of each primer and will proceed in the 5' direction along the 
45 template strand, until synthesis terminates, producing molecules of different lengths. 

"Processive" is a term of art referring to an enzyme's property of acting to synthesize or hydrolyse a polymer 
without dissociating from the particular polymer molecule. For the purposes of the present invention, processive refers 
to enzymes that add, on the average, at least 100, and preferably, about 200 or more, nucleotides before dissociation. 

"Promoter" is a term of art referring to sequences necessary for transcription. It does not include ribosome binding 
so sites and other sequences primarily involved in translation. 

"Purifying" refers to increasing the specific activity of an enzymatic activity over the level produced in a culture in 
terms of units of activity per weight of protein. This term does not imply that a protein is purified to homogeneity 

"Structural gene" is a DNA sequence that is transcribed into messenger RNA that is then translated into a se- 
quence of amino acid residues characteristic of a specific polypeptide. A "structural gene" may contain a heterologous 
ss ribosome binding site, replacing its natural (homologous) ribosome binding site as well as additional sequences to 
allow its rapid cloning into suitable vectors. 

"Substantially pure" means that the desired purified enzyme is essentially free from contaminating cellular com- 
ponents which are associated with the desired enzyme in nature. Contaminating cellular components may include, but 
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are not limited to, phosphatases, exonucleases or endonucleases. 

A molecule is said to be "substantially similar" to another molecule if both molecules have substantially similar 
structures or if both molecules possess a similar biological activity. Thus, provided that two molecules possess a similar 
activity, they are considered variants as that term is used herein even if the structure of one of the molecules is not 
found in the other, or if the sequence of amino acid residues is not identical, 

A "variant" of a molecule such as ft stearothermophilus DNApoll is meant to refer to a molecule substantially 
similar in structure and function to either the entire molecule, or to a fragment thereof. 

B Stearothermophilus DNA pol I genes, including truncated versions 

The ft stearothermophilus DNApoll gene has been cloned and sequenced (see Phang et al., 1995, Gene 163: 
65-68). The nucleotide and amino acid sequences are provided in Figure 1 and as SEQ ID NOS: 1 and 2. respectively 
The recombinant enzyme exhibits similar DNA sequencing properties to the native (wild type) enzyme. 

Various properties of the Bst polymerase enzyme have been studied. For example, Hong et al., EP 0 712 927 A2, 
is directed particularly to a Ssr enzyme possessing 3' to 5* exonuclease activity 

Kong, EP 0757 100 A1, relates specifically to an isolated DNA molecule consisting of nucleotides 868 to 2631 of 
Kong's SEQ ID No: 1, as well as a substantially pure recombinant Bst DNA polymerase substantially free of 3' to 5 
exonuclease activity expressed by that isolated DNA molecule. The Kong clone has only 80% homology to that of the 
polymerase clones of the present invention. Therefore, it is likely that Kong's clone is DNA polymerase III, not DNA 
polymerase I, as in the present invention. 

Riggs et al, Biochimica et Biophysica Acta, 1 307 (1996) 1 78-186 relates to a full-length Bst clone that appears to 
have 98% homology with that of Phang et al, (see Aliotta et al., Genetic Analysis: Biomolecular Engineering 1 2 (1 996) 
185-95). However, Riggs' clone is from a different Bsf strain. Further, Riggs has a two (2) amino acid break within their 
Klenow-like fragment and approximately 28 additional amino acids at their NH 2 -terminal end (with respect to the longest 
exemplified polymerase of the invention, namely Bst pollK, the Bst Klenow fragment which starts from aa 286, such 
that as to other exemplified polymerases Riggs has even more additional amino acids). More particularly, Riggs re- 
moved 5'-3' exonuclease activity by base mutations, while retaining the full-length sequence, and Riggs also removed 
5'-3' exonuclease activity to create a specific deleted Klenow-like Iragment. 

In contrast, in the present invention 5'-3' exonuclease activity is removed or reduced by defining a specific deleted 
Klenow-like clone, Bst pollK, which is 28 amino acids shorter at the NH 2 -terminal end than Riggs" deleted Klenow-like 
fragment. Further, the Bst pollK has a start site which coincides with the NH 2 -terminal end of the proposed Bst Klenow 
fragment following subtilisin digest of Bst DNA Poll to yield a Klenow fragment, and is from expression of a clone with 
two base mutations which were necessary to obtain translation of the deleted Klenow-like clone. Accordingly, the Bst 
Klenow fragment (from aa 286) is significantly different from Riggs' fragment. 

While Riggs may disclose deletions in the 5'-3' exonuclease domain of the enzyme and/or its corresponding gene, 
and asserts that 3' -5' exonuclease activity is a "desired characteristic", Riggs does not describe or suggest DNA en- 
coding a Bst polymerase Klenow-like fragment or a Bst polymerase Klenow-like fragment lacking 3'-5' exonuclease 
activity, as in the present invention. Note also again that as to other exemplified embodiments, e.g., beginning at aa 
371 , 453, 532 or 616, Riggs' fragment is even more significantly longer, and ergo more significantly different. 

This invention in especially advantageous embodiments relates to 3 1 to 5* exonuclease-deficient Bacillus stearo- 
thermophilus {Bst) DNA polymerase, e.g., from deleted or interrupted versions of the DNApoll gene, e.g., versions 
having deletions at the 5' end. These deleted clones code for truncated B. stearothermophilus DNA polymerase en- 
zymes that are deficient in 3' to 5' exonuclease activity These truncated enzymes are extremely useful in nucleic acid 
synthesis by primer extension reactions, particularly DNA sequencing and labelling reactions. 

In order to obtain these deleted clones, a fragment of the DNA polymerase I -encoding gene from ft stearother- 
mophilus was obtained by PCR. This was then used as a probe to obtain a full-length gene from a Bst genomic DNA 
(gDNA) plasmid library. A protein was produced from this gene which gave DNA polymerase activity similar to wild- 
type subtilisin-treated Bst DNA polymerase. A comparison of its amino acid homology allowed an assignment of its 
sequences corresponding to the enzyme domains of E. coli DNApoll. The 3-5' exonuclease activity is proposed to be 
located in a domain within the B. stearothermophilus DNApoll gene. 

The 3' to 5' exonuclease activity of the wild type ft stearothermophilus DNA polymerase I was made deficient by 
selective deletion of amino acid residues required for the 3' to 5' exonuclease activity without inhibiting the DNA syn- 
thesis activity. The wild type Bst nucleic acid sequence was mutated at the 5' end of the gene. The mutated sequence 
may have a single change or multiple changes that alter the nucleotide sequence of the gene. Alterations of the DNA 
sequence include deletions (loss of one or more nucleotides in the DNA sequence) and/or substitutions (substituting 
a difference nucleotide for the original nucleotide along the DNA sequence). 

The truncated ft stearothermophilus DNA polymerase I of this invention is preferably a recombinant protein derived 
from the expression of the gene tor DNA polymerase from B. stearothermophilus; but, one can also obtain a truncated 
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DNA polymerase I by cleaving portions of the polymerase natively expressed by Bst. The truncated B. stearother- 
mophilus DNA polymerase I has an amino acid residue sequence that substantially corresponds to the amino acid 
residue sequence of the native B. stearothermophilus DNA polymerase I enzyme, with the exception of the removal 
or substitution of one or more amino acid residues so as to reduce or cease the 3' -5' exonuclease activity and thereby 
5 render the polymerase. substantially deficient in the 3'-5' exonuclease activity. Any of a number of amino acid removal 
or substitutions in the subject polymerase are contemplated, so long as the requisite activity is produced, All of the 
purified enzymes exhibit DNA synthesis activity, but were deficient in 3* to 5' exonuclease activity. 

The preparation of recombinant B. stearothermophilus DNApoll clones with deletions from the 5' end is further 
detailed in the Examples. 

10 

Specific Truncated Clones 

Certain preferred embodiments of this invention relate specifically to 3' to 5* exonuclease-deficient B. stearother- 
mophilus DNA polymerase I enzymes and the structural genes therefor of 1781 (PK), 1526 (P1), 1281 (P2), 1043 (P3) 

15 and 791 (P4) bp respectively; and, methods of making and using the enzymes and the genes. These clones have 
increasing deletions from the 5' end of the DNApoll gene and give rise to a series of truncated B, stearothermophilus 
DNA polymerase I enzymes that are deficient in 3' to 5* exonuclease activity. Using SDS-PAGE gel electrophoresis 
under denaturing conditions, these monomeric enzymes were determined to have molecular weights of approximately 
65.6, 56.1, 47.1, 38.4 and 29.2 kDa, respectively 

20 The amino acid residue sequences of these truncated B. stearothermophilus DNA polymerase I enzymes are 

included FIG. 1, except that amino acid removal or substitutions are present which confer 3' to 5' exonuclease deficiency 
as described herein. Preferably, these preferred polymerase enzymes have an amino acid residue sequence repre- 
sented by the formula shown in Fig, 1 from residue 286 (e.g., residues 1 to 285 are deleted) (Klenow - PV8168), or 
from residue 371 (e.g., residues 1 to 370 are deleted) (PV8169), or from residue 453 (e.g., residues 1 to 452 are 

25 deleted) (PV8170), or from residue 532 (e.g., residues 1 to 531 are deleted) (PV8171), or from residue 616 (e.g., 
residues 1 to 615 are deleted) (PV8172). The polymerases can culminate at residue 879 (SEQ ID NO; 3, 4, 5, 6 and 
7 respectively). 

Likewise, the preferred polymerases can be from expression of a Bst DNApoll gene which has had deletion, mu- 
tation, or insertional or other inactivation at the 5' end such that nucleotides from 1078 are expressed (e.g., nucleotides 
30 i to 1077 are deleted) or, such that nucleotides from 1333 are expressed (e.g., nucleotides 1 to 1332 are deleted), or 
such that nucleotides from 1578 are expressed (e.g., nucleotides 1 to 1577 are deleted), or such that nucleotides from 
181 6 are expressed (e.g., nucleotides 1 to 1815 are deleted), or such that nucleotides from 2068 are expressed (e.g., 
nucleotides 1 to 2067 are deleted) (SEQ ID NO: 8, 9, 10, 11 and 12, respectively). 

The amino acid residue sequence of a truncated B. stearothermophilus DNA polymerase I can be determined by 
35 any suitable method, such as by automated Edman degradation, and the like. 

The deleted structural genes encode truncated proteins which retain the DNA synthesis domain and but exhibit 
varying degrees of 3* to 5' exonuclease activity. The structural genes, comprising the clones PV8168, PV8169, PV8170, 
P V81 71 and P V81 72 may be derived from the B. stearothermophilus DNApoll structural gene, and is exemplified herein 
by the B. stearothermophilus DNApoll structural gene (see FIGS. 1 & 2). The DNA comprising the B. stearothermophilus 
40 DNApoll structural gene may be derived from genomic DNA, cDNA, synthetic DNA and combinations thereof. 

The polymerases encoded by these truncated genes have at least about 75% homology with B. stearothermophilus 
DNA polymerase I, preferably at least about 90% homology, again provided that the proteins when expressed have B. 
stearothermophilus DNA polymerase I activity. Specific molecules exemplified herein include the proteins derived from 
the clones PV8168, PV8169, PV8170, PV8171 and PV8172, and functional derivatives thereof. A functional derivative 
45 of a DNA molecule is derived from the original DNA molecule but still may express the desired structural gene in a 
host or in vitro, i.e., express a gene encoding B. stearothermophilus DNA polymerase I activity. 

These recombinant DNA molecules each have: a structural gene encoding a protein which has a processive DNA 
polymerase activity; a promoter heterologous to the structural gene; and an origin of replication heterologous to the 
structural gene. In this combination, the promoter and the structural gene are in such position and orientation with 
50 respect to each other that the structural gene may be expressed in a host cell under control of the promoter, and the 
origin of replication is capable of maintaining the promoter/structural gene/origin of replication combination in a host cell. 

Preferably, the promoter and the origin of replication are functional in the same host cell, exemplified herein by an 
E. coli host cell. The DNA molecule is preferably contained by a host cell, exemplified herein by an E. coli host cell 
(strains JM105, XL1-Blue, BL21(DE3), BL21(DE3)pLysS and BL21(DE3)pLysE), but may, of course, exist in vitro. The 
55 promoter may be inducible, e.g. the tac promoter or the T7 promoter. The protein may also have a 3'-5' DNA exonuclease 
activity or may have substantially reduced 3'-5' exonuclease activity. Preferably, the structural gene is not under control 
of a homologous promoter. In the preferred example, the structural gene is under the control of a heterologous ribosome 
binding site, not a homologous ribosome-binding site, 
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Given the herein disclosed teachings, further detailed in the Examples, one of ordinary skill in the art can use 
standard recombinant DNA techniques to express B. stearothermophilus DNApoll or its derivatives in £ coll 

Other promoters, vectors, and host cells, both prokaryotic and eukaryotic, are well known in the art and in keeping 
with the specification, may be used to practice the invention. 

5 The present invention specifically includes B. stearothermophilus DNApoll derivatives. These include genes and 

functional derivatives, the latter with or without enzymatic activity. A "functional derivatives" of B. stearothermophilus 
DNAPoli is a compound which possesses a biological activity (either functional or structural) that is substantially similar 
to a biological actual of Bst DNApoll. The term functional derivatives includes fragments, variants, analogues, or chem- 
ical derivatives of a molecule. Thus, provided that two molecules possess a similar activity, they are considered variants 

io even if the structure of one of the molecules is not found in the other, or if the sequence of amino acid residues is not 
identical. 

Production of Recombinant Truncated B. Stearothermophilus DNApoll Proteins (Cloning) 

is Cloning involves the use of recombinant DNA technology to propagate DNA fragments inside a foreign host. The 

fragments are usually isolated from chromosomes using restriction enzymes and then united with a carrier such as a.- 
vector. Following introduction into suitable host cells, the DNA fragments can then be reproduced along with the host 
cell DNA. Cloning provides unlimited material for experimental study. " " 

The present invention also provides a method for production of a protein having a processive DNA polymerase 

20 activity from a host cell containing a recombinant DNA molecule under conditions where the structural gene is ex- 
pressed, followed by purifying the protein expressed during the culturing step. 

Production of recombinant B. stearothermophilus DNApoll proteins is typically produced by recombinant DNA. 
techniques, from a gene encoding the modified enzyme. Thus, the present invention also contemplates a DNA segment 
consisting essentially of a sequence of nucleotide bases encoding a truncated S. stearothermophilus DNApoll of this 

2S invention. 

Of course, modifications to the primary structure itself by deletion, addition, or alteration of the amino acids incor- 
porated into the protein sequence during translation can be made without destroying the functionality of the protein. 
Such substitutions or other alterations result in proteins having an amino acid sequence encoded by DNA falling within 
the contemplated scope of the present invention. 

30 Mutation can be accomplished by a variety of methods well known in the art that are not criticaLto the invention. 

Mutation methods include, but are not limited to site-directed mutation, random mutation, error-prone PCR, in vivo 
mutagenesis using oligonucleotides, and the like. A preferred mutation procedure has been described herein the Ex- 
amples. The resulting mutated gene encodes a recombinant truncated B. stearothermophilus DNApoll. The mutated 
gene is then provided in the form of a DNA segment (excised or recovered coding sequence) that encodes the recom- 

35 binant B. stearothermophilus DNApoll protein using standard techniques. 

The invention also contemplates DNA segments that encode truncated B. stearothermophilus DNApoll proteins 
of this invention. DNA molecules, such as plasmids and bacteriophage genomes containing a DNA segment, are also 
a part of this invention. Host cells containing a plasmid, DNA segment, DNA molecule or bacteriophage genome of 
this invention are also contemplated. Preferably, a truncated B. stearothermophilus DNApoll -encoding DNA of this 

^o invention is in the form of a plasmid, cosmid or phage. A preferred DNA segment containing a gene that encodes a 
truncated B. stearothermophilus DNApoll is present on the plasmids PV81 68, PV81 69, PV81 70, PV81 71 and PV81 72. 

A preferred recombinant DNA molecule includes a nucleotide sequence shown in FIG. 1 from nucleotide base 223 
to base 2859, but having various nucleotide deletions from the 5'-end according to the present invention to provide the 
required 3'-5' exonuclease deficiency in the coded B. stearothermophilus DNApoll. 

45 A DNA segment defining a truncated B. stearothermophilus DNApoll of this invention can be prepared by a variety 

of molecular biological techniques. The precise nucleotide sequence is not critical so long as the resulting encoded 
protein has the requisite properties, and preferably encodes a preferred amino acid residue sequence as described 
herein. 

In one approach, a DNA segment can be assembled by the systematic hybridization and ligation of synthetic 
so oligonucleotides to form the complete DNA segment. Synthetic oligonucleotides may be prepared by a variety of chem- 
ical synthetic means, such as using the triester method of Matteucci et al., 1981, J. Am. Chem. Soc. 103: 3185-3191, 
or using automated synthesis methods. 

A recombinant DNA molecule of the present invention can he produced by operatively linking a vector to a DNA 
segment of the present invention. 
55 As used herein, the term "vector* refers to a nucleic acid molecule capable of transporting between different genetic 

environments another nucleic acid to which it has been operatively linked. Preferred vectors are those capable of 
autonomous replication and/or expression of nucleic acids to which they are operatively linked are relerred to herein 
as "expression vectors". As used herein, the term "operatively linked", in reference to DNA segments, describes that 
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the nucleotide sequence is joined to the vector so that the sequence is under the transcriptional and translation control 
of the expression vector and can be expressed in a suitable host cell. 

As is well known in the art, the choice ot vector to which a protein encoding DNA segment of the present invention 
is operatively linked depends upon the functional properties desired e.g., protein expression, and upon the host cell 

5 to be transformed. These limitations are inherent in the art of constructing recombinant DNA molecules. However, a 
vector contemplated by the present invention is at least capable of directing the replication, and preferably also ex- 
pression, of a gene operatively linked to the vector. 

The methods for making a vector or recombinant can be by or analogous to the methods disclosed in U.S. Patent 
Nos. 4,603,112, 4,769,330, 5,174,993, 5,505,941 , 5,338,683, 5,494,807, and 4,722,848, WO 95/30018, Paoletti, "Ap- 

10 plications of pox virus vectors to vaccination: An update," PNAS USA 93:11349-11353, October 1996, Moss. "Genet- 
ically engineered poxviruses lor recombinant gene expression, vaccination, and safety," PNAS USA 93:11341 -11348, 
October 1996, Smith et al., U.S. Patent No. 4,745,051 (recombinant baculovirus), Richardson, CD. (Editor), Methods 
in Molecular Biology 39 , 'Baculovirus Expression Protocols" (1995 Humana Press Inc.), Smith et aL, "Production of 
Huma Beta Interferon in Insect Cells Infected with a Baculovirus Expression- Vector," Molecularand Cellular Biology, 

15 Dec, 1983, Vol, 3, No. 12, p. 2156-2165; Pennock.£t_aL, "Strong and Regulated Expression of Escherichia coliB- 
Galactosidase in Infect Cells with a Baculovirus vector,' Molecular and Cellular Biology Man 1984, Vol. 4, No. 3, p. 
399-406; EPA 0 370 573, U.S. application Serial No. 920, 1 97, filed October 16,1 986, EP Patent publication No. 265785, 
U,S. Patent No. 4,769,331 (recombinant herpesvirus), Roizman, "The function of heroes simplex virus genes: A primer 
for genetic engineering of novel vectors," PNAS USA 93:11307-11312, October 1996, Andreansky et al., "The appli- 

20 cation of genetically engineered herpes simplex viruses to the treatment of experimental brain tumors," PNAS USA 
93: 1 1 31 3-1 1 31 8, October 1 996, Robertson et al. "Epstein-Barr virus vectors for gene delivery to B lymphocytes," PNAS 
USA 93:11334-11340, October 1996, Frolovet al., "Alphavirus-based expression vectorsr Strategies and applications, 
" PNAS USA 93:11371-11377, October 1996, Kitson et al., J. Virol. 65, 3068-3075, 1991; U.S. Patent Nos. 5,591,439, 
5,552,143, Grunhaus et aL, 1992, "Adenovirus as cloning vectors,' Seminars in Virology (Vol. 3) p. 237-52, 1993, 

25 Ballay et al. EMBO Journal, vol. 4, p. 3861-65, Graham, Tibtech 8, 85-87, April, 1990, Prevec et al., J. Gen Virol. 70, 
429-434, PCT W091/11525, Feigner etal. (1994), J. Biol. Chem. 269, 2550-2561, Science. 259:1745-49, 1993 and 
McClements et al., "Immunization with DNA vaccines encoding glycoprotein D or glycoprotein B, alone or in combina- 
tion, induces protective immunity in animal models of herpes simplex virus-2 disease," PNAS USA 93:11414-11420, 
October 1996, and U.S. Patents Nos 5,591 : 639, 5,589,466, and 5,580,859 relating to DNA expression vectors, inter 

30 alia. That is, the techniques of the herein cited documents can be used to express Bst DNA polymerase I, either whole 
or truncated, in vector systems of those documents, using the teachings herein and the knowledge in the art (e.g., as 
shown by documents cited herein), without any undue experimentation. 

In preferred embodiments, a vector contemplated by the present invention includes a prokaryotic replicon, i.e., a 
DNA sequence having the ability to direct autonomous replication and maintenance of the recombinant DNA molecule 

35 extrachromosomally in a prokaryotic host cell, such as a bacterial host cell, transformed therewith, Such replicons are 
well known in the art. In addition, those embodiments that include a prokaryotic replicon may also include a gene whose 
expression confers a selective advantage such as amino acid nutrient dependency or drug resistance to a bacterial 
host transformed therewith as is well known, in order to allow selection of transformed clones. Typical bacterial drug 
resistance genes are those that confer resistance to ampicillin, tetracycline, or kanamycin. 

Those vectors that include a prokaryotic replicon may also include a prokaryotic promoter capable of directing the 
expression (transcription and translation) of the gene transformed therewith. A promoter is an expression control ele- 
ment formed by a DNA sequence that permits binding of RNA polymerase and transcription to occur. Promoter se- 
quences compatible with bacterial hosts are typically provided in plasm id vectors containing convenient restriction sites 
for insertion of a DNA segment of the present invention. Bacterial expression systems, and choice and use of vectors 

45 in those systems is described in "Gene Expression Technology", Meth. Enzymol., Vol 185, Goeddel, Ed., Academic 
Press, N.Y. (1990) and Shatzman, A.R., 1990, Current Opinion in Biotech. 1: 5-11. 

Expression vectors compatible with eukaryotic cells, preferably those compatible with vertebrate cells, can also 
be used to form the recombinant DNA molecules of the present invention. Eukaryotic cell expression vectors are well 
known in the art and are available from several commercial sources. Typically such vectors contain convenient re- 

50 striction sites for insertion of the desired gene. Typical vectors are pSVL and pKSV-10 (Pharmacia), pBPV-1/pML2d 
(International Biotechnologies, Inc.), and pTDT1 (ATCC, #31255). 

In preferred embodiments, the eukaryotic cell expression vectors used to construct the recombinant DNA mole- 
cules of the present invention include a selectable phenotypic marker that is effective in a eukaryotic cell, such as a 
drug resistance selection marker or selective marker based on nutrient dependency. A preferred drug resistance marker 

55 js the gene whose expression results in neomycin resistance, i.e., the neomycin phosphotransferase (neo) gene. South- 
ern etal., 1982, J. Mol. Appl. Genet. 1: 327-341. 

The vector is used to transform a suitable host and the transformed host cultured under favorable conditions to 
effect the production of the recombinant B. stearothermophilus DNApoll by expression of the gene and subsequent 
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protein production in the compatible transformed host. 

The synthesized B. stearothermophilus DNApoll is isolated Irom the medium or from the cells. Recovery and 
purification of the protein may not be necessary in those instances where some impurities may be tolerated. 

Each of the foregoing steps can be done in a variety of ways. For example, the desired coding sequences may 

5 be obtained from genomic fragments and used directly in appropriate hosts. The constructions for expression vectors 
operable in a variety of hosts are made using appropriate repl icons and control sequences, as set forth below. Suitable 
restriction sites can, if not normally available, be added to the ends of the coding sequence so as to provide an excisable 
gene to insert into these vectors. 

The recombinant DNA molecules of the present invention are typically introduced into host cells, via a procedure 

10 commonly known as transformation or transfection. Transformation of appropriate host cells with a recombinant DNA 
molecule of the present invention is accomplished by well known methods that typically depend on the type of vector 
used (Cohen et aL, 1972, Proc. Natl, Acad. Sci. USA 69: 2110; and Maniatis et aL, 1982, Molecular Cloning, A Labo- 
ratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y.; Rosenberg and Moss, 1990, Current Opinion 
in Biotech. 1: 3-4; Sorge et al., 1984, Mol. Cell. Biol. 4: 1730-37; and Wigler et al., 1979, Proc. Natl. Acad. Sci. USA 

*5 76; 1373-76). Generally, prokaryotic, yeast, insect, mammalian or plant cells are presently useful as hosts. - ■ 

Prokaryotic hosts are in general the most efficient and convenient for the production of recombinanf proteins and 
• therefore are preferred for the expression of B. stearothermophilus DNApoll, and are represented by various strains 
of £ coli. Given the herein disclosed teachings, further detailing in the Examples, one of ordinary skill in the art can 
use standard recombinant DNA techniques to express B. stearothermophilus DNApoll or its derivatives in £ coli. 

20 However, other microbial strains may also be used, such as bacilli, for example, Bacillus subtffls, various species of 
Pseudomonas, or other bacterial strains. In such prokaryotic systems, plasm id vectors that contain replication sites 
and control sequences derived from species compatible with the host are used. For example, £ coli\s typically trans- 
formed using derivatives of pBR322, a plasmid derived from an £ coli species by Bolivar et aL, 1 977, Gene 2: 95 and 
Sutcliffe, 1978, Nuc. Acids Res. 5: 2721-28. pBR322 contains genes for ampicillin and tetracycline resistance, and 

25 thus provides additional markers that can be either retained or destroyed in constructing the desired vector. 

Commonly used prokaryotic control sequences are defined herein to include promoters for transcription initiation, 
optionally with an operator, along with ribosome binding site sequences. Some examples of commonly used promoters 
include the B-lactamase (penicillinase) and lactose (lac) promoter systems (Chang et al., 1977, Nature 198: 1056), 
the tryptophan (trp) promoter system (Goeddel et al., 1980, Nucleic Acids Res. 8: 4057) and the lambda-derived PL 

30 promoter (Shimatake et al., 1981, Nature 292: 28) and N-gene ribosome binding site, which has been made useful as 
a portable control cassette (as set forth in U.S. Pat. No. 4,711,845), which comprises a first DNA sequence that is the 
PL promoter operably linked to a stream of a third DNA sequence having at least one restriction site that permits 
cleavage with six bp 3' of the NRBS sequence. 

Also useful is the phosphatase A (phoA) system described by Change et aL in European Patent Publication No. 

35 196,864. However, any available promoter system compatible with prokaryotes can be used. Typical bacterial plasmids 
are pUC8. pUC9, pBR322 and pBR329 available from Bio-Rad Laboratories, (Richmond, Calif.) and pTrc99A described 
herein, pPL and pkk233-2, available from Pharmacia (Piscataway, N J.) or Clone Tech (Palo Alto, Calif. ), pBluescript 
and pET (Stratagene, USA) and the like equivalent plasmid expression vectors. Preferred £ coli host cells include 
RR1, XL1-Blue (Stratagene), and the like, in addition to the equivalent host cell described herein. 

40 in addition to bacteria, eukaryotic microbes, such as yeast, may also be used as hosts. Laboratory strains of 

Saccharomyces cerevisiae, Baker's yeast, are most used, although a number of other strains are commonly available. 
While vectors employing (he two micron origin of replication are illustrated (Broach, 1 983, Meth. Enz., 101: 307), other 
plasmid vectors suitable for yeast expression are known (see, for example, Brake et al., 1984, Proc. Natl. Acad. Sci 
USA 81: 4642-4647; Clarke etal., 1983 Meth. Enz. 101:300; Halewell etaL, 1987, Biotechnology 5: 363-366; Stinch- 

45 comb et aL, 1979, Nature, 282:39, Tschempe et aL, 1980, Gene 10: 157). Control sequences for yeast vectors include 
promoters for the synthesis of glycolytic enzymes (Hess et al., 1968, J. Adv Enzyme Reg. 7; 149; Holland etal., 1978, 
Biotechnology 1 7: 4900). 

It is also possible to express genes encoding polypeptides in eukaryotic host cell cultures derived from multicellular 
organisms. See, for example, Tissue Culture, Academic Press, Cruz and Patterson, editors (1973). Useful host cell 

so lines include murine myelomas M51, VERO and HeLA ccPs, Chinese hamster ovary (CH0) cells available from the 
ATCC as CCL61, and NIH/3T3 mouse cells available from the ATCC as CRL1658. Expression vectors lor such cells - 
ordinarily include promoters and control sequences compatible with mammalian cells such as, for example, the com- 
monly used early and late promoters from Simian Virus 40 (SV 40) (Fiers et al., 1 978, Nature 273: 113), or other viral 
promoters such as those derived from polyoma, adenovirus 2, bovine papilloma virus, or avian sarcoma viruses, or 

55 immunoglobulin promoters and heat shock promoters. 

A system for expressing DNA in mammalian systems using the HPV as a vector is disclosed in U.S. Pat. No. 
4,419,446. A modification of this system is described in U.S. Pat. No. 4,601,978. General aspects of mammalian cell 
host system transformations have been described in U.S. Pat. No. 4,399,216. 
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It now appears, also, that "enhancer" regions are important in optimizing expression. Enhancer regions are se- 
quences generally found upstream of the promoter region. Origins of replication may be obtained, if needed, from viral 
sources. However, integration into the chromosome is a common mechanism for DNA replication in eukaryotes. 

Plant ceils are also available as hosts. Control sequences compatible with plant cells such as the nopal ine synthase 
5 promoter and polyadenylation signal sequences (Depicker et al., 1982, J. Mol. Appl. Gen., 1: 561) are also available. 
See, also, U.S. Pat. No. 4,962,028, U.S. Pat. No. 4,956,282, U.S. Pat. No. 4,886,753 and U.S. Pat. No. 4,801,540. 

In addition, expression systems employing insect cells utilizing the control systems provided by baculovirus vectors 
have been described (Miller et al., 1986, In Genetic Engineering, Setlow, J. K. et at., eds., Plenum Publishing, Vol. 8, 
pp. 277-297). See, also, U.S. Pat. No. 4,745,051 and U.S. Pat. No. 4,879,236. 
10 Infection with Agrobacterium tumefaciens (Straw et al., 1983, Gene 23: 315) is used for certain plant cells. For 

mammalian cells without cell walls, the calcium phosphate precipitation method of Graham and van der Eb, 1978, 
Virology 52: 546 is preferred. Transformations into yeast are carried out according to the method of \fan Solingen et 
al., 1977, J. Bacterid. 130: 946 and Hsiao,1979, Proc. Natl. Acad Sci. USA. 76: 3829. 

In addition to the transformed host cells themselves, cultures of the cells are contemplated as within the present 
75 invention. The cultures include monoclonal (clonally homogeneous) cultures, or cultures derived from a monoclonal 
culture, in a nutrient medium. Nutrient media useful for culturing transformed host cells are well known in the art and 
can be obtained from several commercial sources. In embodiments wherein the host cell is mammalian, a "serum- 
free" medium is preferably used. 

The present method entails culturing a nutrient medium Containing host cells transformed with a recombinant DNA 
20 molecule of the present invention that is capable of expressing a gene encoding a subject polypeptide. The culture is 
maintained for a time period sufficient for the transformed cells to express the subject polypeptide. The expressed 
polypeptide is then recovered from the culture. Recovery can be by techniques known in the art for isolating polymer- 
ases (Cf. Lawyer, et al., 1989). 

Standard techniques of protein purification may be employed to isolate and/or further purify the polypeptide 
25 (polymerase) including: precipitation by taking advantage of the solubility at varying salt concentrations; precipitation 
with organic solvents, polymers and other materials; affinity precipitation and selective denaturation; column chroma- 
tography, including high performance liquid chromatography (HPLC), ion-exchange, affinity, immuno affinity or dye- 
ligand chromatography; immunoprecipitation; and the use of gel filtration, elect rophoretic methods, ultrafiltration and 
isoelectric focusing. Each of the above-identified methods are well within the knowledge of the skilled artisan, and no 
30 undue experimentation is required to isolate and/or purify the polypeptides or polymerases after expression thereof by 
a vector or a recombinant, using the standard methodologies outlined herein, and in the literature, as well as the 
teachings in the Examples below 

Once a gene has been expressed in high levels, a DNA fragment containing the entire expression assembly (e. 
g., promoter, ribosome-binding site, and fusion protein gene) may be transferred to a plasmid that can attain very high 
35 copy numbers. For instance, the temperature-inducible "runaway replication" vector pKN402 may be used. Preferably, 
the plasmid selected will have additional cloning sites which allow one to score for insertion of the gene assembly 
See, Bittner et al., 1 981 , Gene 15: 31 . Bacterial cultures transformed with the plasmids are grown for a few hours to 
increase plasmid copy number, e.g., to more than 1000 copies per cell. Induction may be performed in some cases 
by elevated temperature and in other cases by addition of an inactivating agent to a repressor. Potentially very large 
increases in cloned truncated B. stearothermophilus DNApoll can be obtained in this way. 

As described herein, the clones designated PV8168, PV8169, PV8170, PV8171, PV8172, produced using oligo- 
nucleotide PK, P1, P2, P3, P4 and a 3' primer labelled P5, were determined to retain DNA polymerase activity, and 
possess essentially no exonuclease activity when compared to wild type Bst DNApoll. PV8168 has been shown to 
retain 100% DNA polymerase activity, 

45 

Use of Deleted Clones in DNA Synthesis Reactions 

The deleted clones ol this invention are useful in nucleic acid synthesis by primer extension reactions, particularly 
DNA cycle-sequencing reactions. DNA polymerases are used to synthesize DNA molecules in the 5' to 3' direction 
so from deoxynucleotide triphosphates (nucleotides) using a complementary template DNA strand and a primer. The DNA 
polymerase successively adds nucleotides to the free 3-hydroxyl end to the growing strand. 

In molecular biology, DNA polymerases used in DNA sequencing reactions generally use either 35S-, 32P- or 33P- 
labelied nucleotides. The proof reading or editing function of the 3' to 5* exonuclease activity of many DNA polymerases 
can interfere with processivity and is often disadvantageous in situations where one is trying to achieve net synthesis 
ss of DNA. The novel deleted clones having reduced or lacking such 3' to 5' exonuclease activity provided for in this 
invention are therefore extremely useful for DNA synthesis reactions. 

Bst DNA polymerase I is a superior alternative to traditional DNA sequencing enzymes. With its 65°C temperature 
optimum, reactions can be performed at elevated temperatures to reduce secondary structure in the DNA and accom- 
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modate stringent template/primer appealing conditions. 

Some sequencing procedures currently involve first subctoning DNA fragments from a cosmid or bacteriophage 
library into special sequencing vectors that carry shorter pieces of the original cosmid fragments. The next stop is to 
make the subcloned fragments into sets of nested fragments differing in length by one nucleotide, so that the specific 
s base at the end of each successive fragment is detectable after the fragments have been separated by gel electro- 
phoresis. 

Sanger et al. 1977 introduced a sequencing method using chain-terminating dideoxynucleoside triphosphates 
(ddNTPs). These ddNTPs differ from conventional dNTPs in that they lack a hydroxyl residue at the 3' position of 
deoxyribose but can be incorporated by DNA polymerases into a growing DNA chain through their 5' triphosphate 

'0 groups. The absence of a 3' hydroxyl residue prevents formation of a phosphodiester bond with the succeeding dNTP 
Further extension of the growing DNA chain is therefore impossible. Thus, when a small amount of one ddNTP is 
included with four conventional dNTPs in a reaction mixture for DNA synthesis, there is competition between extension 
for the chain and infrequent, but specific, termination. By using four different ddNTPs in four separate enzymatic re- 
actions, populations of oligonucleotides are generated that terminate at positions occupied by every A, C, G, or T in 

'5 the template strand The deleted B. stearothermophilus polymerase clones (lacking 3' to 5" exonuclease activity) are 
especially suitable for such sequencing. 

The Ssf DNA polymerase is useful in both manual and automated radiolabeled DNA sequencing strategies. These 
automated DNA sequencing protocols that save laboratory time and costs while dramatically increasing power and 
precision. 

20 The B. stearothermophilus DNAPoll has the thermostability required for automated sequencing combined with the 

requisite precision. Several properties of this enzyme including a high reaction temperature of 65°C, the ability to read 
through sequences with high G/C content, and the production of even band intensities even in multiple nucleotide runs, 
make Bst polymerase particularly appropriate for routine use in automated DNA sequences techniques. ~ 

25 Use of Bst DNApoll and of Deleted Clones in DNA Labelling Reactions 

The deleted clones described in this application are also useful in DNA labelling reactions. Many molecular bio- 
logical techniques require the use of DNA polymerases in DNA labelling reactions using either 35s-, 32P- or 33P- 
labelled nucleotides. 

30 DNA polymerase enzymes can be used for labelling of DNA by techniques such as nick translation. Double-strand- 

ed DNA is treated with limiting amounts of DNAase I in the presence of magnesium ions. The resulting nicks serve as 
primers for DNA synthesis catalyzed DNA polymerase I. During synthesis, dNTP precursors are incorporated into the 
growing chain of DNA and the nick is translated along the DNA in a 5' to 3' direction by virtue of the 5* to 3' exonuclease 
activity carried by the enzyme. If radioactive precursors are supplied to the enzyme, the resulting product is radiolabeled 
35 and can be used as a hybridization probe. Ssf deleted clones do not necessarily have 5'-3' exonuclease activity; and 
therefore, they cannot necessarily be used in nick translation. However, full length clones of Bst DNApoll may have 
5'-3' exonuclease activity and may be useful in nick translation. Further, some newer nick translation techniques may 
not require 5' to 3' exonuclease activity and thus the technique may be useful with truncated clones of the invention. 
The DNA polymerase can also be used to label the termini of DNA fragments by using 32 P dNTPs to fill recessed 
40 3' termini (end-labeling) and end labelling of DNA molecules with protruding 3' tails. The 5* and 3' termini of DNA can 
be radiolabeled enzymatically to yield molecules that may be used in nuclease-SI mapping of RNA, as radiolabeled 
primers in primer-extension reactions, and as markers of defined size in gel electrophoresis. To add distinct labels to 
isolated DNA molecules, DNA is copied by the DNA polymerase enzyme in the presence of nucleosides that are either 
radioactive or chemically tagged. 
45 Synthetic oligonucleotides can also be labelled using the deleted clones. A short primer is hybridized to an oligo- 

nucleotide template whose sequence is the complement of the desired radiolabeled probe. The primer is then extended 
using the Klenow fragment to incorporate radiolabeled dNTPs in a template directed manner. After the reaction, the 
template and product are separated by denaturation followed by electrophoresis through a pofyacrylamide gel under 
denaturing conditions. With this method, it is possible to generate oligonucleotide probes that contain several radio- 
so active atoms per molecule of oligonucleotide. 

Similar to DNA sequencing reactions, these polymerases can also be used in automated DNA labelling protocols. 
The B. stearothermophilus DNAPoll has the thermostability required for automated labelling combined with precision. 
Due to the high reaction temperature (65° C) of the truncated Ssf polymerase enzymes, the ability of the enzymes to 
read through sequences with high G/C content, and the production of even band intensities even in multiple nucleotide 
55 runs makes the deleted clones particularly useful in automated labelling reactions. 

With respect to automated DNA sequencing or DNA labeling in which embodiments of the invention may be useful, 
reference is made to U.S. Patents Nos. 5,122,345; 5,674,743; 5,556,790; 5,302,509; 5,728,529; 5,707,804; 5,688,648; 
5,654,419; 5,614,386; 5,639,874; 5,674,716; 5,614,365; 5,543,026; and 5,543,018. As to automated DNA sequencing 
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or labelling with polymerase, in which embodiments of the invention may be useful, reference is made to U.S. Patents 
Nos. 5,487,972; 5,660,989; 5,523,204; 5,604,098; 5,122,345; 5,635,347; 5,210,015; 5,728,529; 5,716,784; 5,573,907; 
5,302,509; 5,599,675; 5,702,888; 5,108,892; 5,723,298; 5,674,743; 5,348,853; 5,552,278; 5,593,840; and 5,674.679^ 
see also NO 94/16107 relating to DNA sequencing with Bst polymerase. 

Thus, it is believed that it is novel and nonobvious to use recombinant Bst polymerase or recombinant Klenow- 
like fragments from recombinant Bst polymerase or the isolated gene therefor, especially the advantageous truncated 
Bst polymerases (e.g., lacking or having reduced 3' to 5' exonuclease activity), as well as DNA therefor, in DNA se- 
quencing and labelling techniques, such as automated DNA sequencing and/or labelling techniques; for instance, as 
shown in the documents cited herein. And therefore, the invention comprehends the use of recombinant Ssrpolymerase 
or recombinant Klenow-like fragments from recombinant Bst polymerase or the isolated gene therefor, especially the 
advantageous truncated Bst polymerases (lacking or having reduced 3' to 5' exonuclease activity) as well as DNA 
therefor, in DNA sequencing and labelling techniques. 

Preparat bn of Hybrid Proteins 



There is a need for modular DNA polymerases with defined enzymatic domains which can be used in any combi- 
nation to create hybrid proteins with novel properties. The Bst DNApoll of this invention may be used in cloning and 
in conjunction with other clones to produce heterologous polypeptides from the clones genes. 

Enzymes may consist of a single catalytic domain or of a series of independent catalytic domains linked together 
^0 by a peptide backbone. One need only preserve the sequence of amino acids which forms the catalytic domains for 
the enzyme to function. Novel chimeric enzymes can be derived from linking together two or more catalytic domains, 
each retaining their enzymatic activity, to create a novel enzyme. 

To create these hybrid proteins with novel properties, one or more of these deleted clones or a part thereof can 
be combined with other clones. In attempting to create such a novel enzyme, it may be necessary to derive modular 
25 structures form different enzymes which retain enzymatic activity (catalytic domains) and link the separate catalytic 
domains together to create a novel enzyme. The sequence of amino acid which forms the catalytic domain for each 
enzyme must be preserved during the linkage. The linkage may be achieved by gene cloning. 

In the case of an enzyme with two or more catalytic domains, each of the catalytic domains must be maintained 
in a proper spatial relationship with respect to each other. A recombinant B. stearothermophilus DNApoll lacking 3'-to- 
30 5' exonuclease activity may be thus combined with other modular catalytic domains to further create other enzymes. 
Thus, the B. stearothermophilus DNApoll of this invention may be used in cloning and in conjunction with other clones 
to produce heterologous polypeptides from the cloned genes. 

These chimeric or fusion enzymes, e.g., comprising two or more truncated Bst polymerase I enzymes (for instance 
from expression of DNA encoding such enzymes which was operably linked) are also useful in DNA sequencing or 
55 labelling, as are chimeric or fusion proteins comprising at least one truncated Bst polymerase I enzyme and a catalytic 
domain. 

A better understanding of the present invention and of its many advantages will be had from the following non- 
limiting Examples, given by way of illustration. 

40 EXAMPLES 

EXAMPLE 1. 

Bacterial growth and genomic DNA preparation 



B. stearothermophilus was grown at 60°C with shaking in L-broth (1 0 g tryptone (Difco), 5 g yeast extract, 5 g NaCI 
per litre pH 7.2) (Phang et al.,1995, Gene 163: 65-68). Bacterial cells were spun down at room temperature and re- 
suspended in 1.5 ml sucrose buffer (25 % sucrose, 50 mM Tris pH 8.0). 150 uJ lysozyme (10 mg/ml) was added and 
the mixture was left at 20°C for 1 nr. A further 9 ml STET buffer (150 mM NaCI, 20 mM Tris pH 8.0, 1 mM EDTA pH 
50 8.0), 1 .5 ml of 5 % SDS and 1 .21 5 ml proteinase K (1 mg/ml) was added and the mixture left at 37°C for 1 hr. 2.16 ml 
of 5 M NaCI and 1 .8 ml of a 0.7 M NaCI/10 % CTAB solution was added and the mixture was extracted with chloroform/ 
isoamyl-alcohol and phenol/chloroform/isoamyl-alcohol. The DNA was spooled from 0.6 vol isopropanol. The DNA 
was washed in 70% ethano! and air-dried. The pellet was redissolved in TE buffer (20 mM Tris pH 8.0, 1 mM EDTA 
pH 8.0). 
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EXAMPLE 2. 

Plasmid purification 

5 Recombinant E. co/zclones were grown at 37°C with shaking in L-broth with 200 u.g/ml ampicillin. The transformed 

cells were cultured in 4ml of L-broth overnight. 3 ml of the culture was pelleted in a microcentrifuge tube. The pellet 
was resuspended in 100 uJ of GTE (50 mM glucose; 25 mM Tris-HCI, pH 8.0; 10 mM EDTA, pH 8.0) by vigorous vortex 
mixing. 200 uJ of lysis buffer (0.2 M NaOH; 1% sodium dodecyl sulfate) was added immediately and the tube was 
inverted and mixed. 150 ul of 5 M potassium acetate (prepared by mixing 60 ml 5M potassium acetate; 11.5 ml glacial 

io acetic acid and 28.5 ml H20) was added and mixed quickly. The mixture was centrifuged for 1 min at 14000 rpm. The 
supernatant was transferred to a fresh tube and 1 ml of 95% ethanol was added. The mixture was centrifuged for 1 
min at 14000 rpm. The supernatant was removed and the pellet was dried down. The plasmid DNA was resuspended 
in 200 uJ of TE buffer (10 mM Tris-HCI, 1 mM EDTA, pH7.4) containing 20 u.g/ml RNase A 

is EXAMPLE 3. 

Homology studies to define the approximate location and size of the DNA synthesis domain - . 

The amino acid sequence of the 3'-5' exonuclease domain of E. coli DNApoll was used to locate a similar domain 

20 for Bacillus subtilis DNApoll (Barnes et al., 1992, Gene 111: 43-49). Similarly the 3-5' exonuclease domain of the a 
stearothermophilus DNApoll can be located using E. coli DNApoll as the basis for comparison (Ito and Braithwaite, 
1991, Nucleic Acids Res. 19: 4045-4057). This was done by using a computer program (DNASIS V2.0) to compare 
the amino acid sequences, There is a 42% match between the two sequences. Sequence homology in a stearother- 
mophilus DNApoll appears to begin around 436 amino acid (aa) residue and ends at the 954 amino acid residue. This 

2S gives a Klenow-like fragment for B. stearothermophilus DNApoll of about 518 amino acids, which is much shorter than 
the 600 amino acid expected for this family (Joyce, 1 991 , Current Opinion in Structural Biol. 1 : 1 23-1 29). 

By comparing the peptide sequences of various DNA-dependent polymerases, three conserved segments termed 
Exo I, Exo II, and Exo III, have been proposed to be located in the 3'-5' exonuclease domain and, thus define the extent 
of the 3'-5' exonuclease domain to consist of about 207 amino acids extending from the 350 to 557 residues in E. coli 

30 DNApoll (Blanco et al., 1992, Gene 112: 139-144; Reha-Krantz, 1992, Gene 112: 133-137). Yet another reference 
(Steitz and Joyce, 1987, In Protein Engineering, Chapt. 20, pp. 227-235. Oxender, D.A., and Fox, C.F. (eds). Alan R. 
Liss, New York), drawing its conclusion from X-ray crystallography, has indicated that the exonuclease domain of the 
E. coli DNApollK fragment is expected to be about 193 amino acids in length, extending from 324 to 517 aa residue. 
Thus, there are uncertainties as to the exact size and siting of the various domains. In view of this, a series of 

35 primers were designed to clone a series of 5' deleted B. stearothermophilus DNApoll clones using polymerase chain 
reaction (PCR). These series of 5' deleted clones will increasingly define the DNA synthesis domain. 

EXAMPLE 4. 

4 o Designing of primers and production of the deleted genes 

Primers were from Genosys, USA. All restrict ion enzymes were from New England Biolabs, USA. T4 DNA ligase 
was from Promega, USA. Taq polymerase was from Promega, USA, and the radioactive nucleotides were from NEN- 
Dupont, USA. 

45 The main concern of primer design here is to maintain the correct reading frame (I nnis et al. , 1 990, "PCR Protocols: 

A Guide to Methods and Applications". Academic Press, San Diego). In this case, either of the vectors exemplified 
here, pTrc99A or pET11d, has an engineered Ncol restriction site in frame for translation starting with the ATG codon 
which has been aligned with the E. coli ribosome binding site. Thus, cloning of the 5' end of the gene into this Ncol 
restriction site would immediately aligned it in the right reading frame for translation to occur. 

50 As the 5* end of the B. stearothermophilus DNApoll gene is to be deleted in a sequential series, the 5' end primers 

must be designed to introduce a Ncol restriction site as well as to prime to the designated sites. After DNA amplification, 
the amplified clones are digested with Ncol restriction enzyme to create sticky ends suitable for cloning into the vector 
The 5' end primers are designated PK, P1 . P2, P3 and P4 and their priming sites are indicated in Fig. 1 . 
The 5' end primers are: 
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P2: 5' -CAT G CC /ATG/ G CG GCG GCG ATT TGG-3 ' 

P3 ; 5 ' -CAT G CC /ATG/ G CG CCA AAA CAG CTC GGG-3 ' 

P4: 5 ' -CAT G CC /ATG/ G GC CGC CTC AGC TC-3' 



PK: 5' -CAT G CC / ATG/ G CC GTC CAA-3' 



(SEQ 


ID 


NO: 


13) 


(SEQ 


ID 


NO: 


14) 


(SBQ 


ID 


NO: 


15) 


(SEQ 


ID 


WO: 


16) 


(SEQ 


ID 


NO: 


17) 



The sequence which has been underlined is the Ncol restriction site and the boxed and bold sequence ( /ATG/ ) is the 
translation start. Some bases have been altered to enhance hybridization as in a silent mutation while maintaining the 
peptide sequence. Translation initiation has been known to be affected by the formation of base-paired secondary 

is structures. It has been possible to correlate low or no protein synthesis with proposed mRNA secondary structures 
that involve either the initiating AUG codon or the Shine-Dalgarno sequence (Buell et al., 1985; Iserentant and Fiers, 
1 980; Reznikoff and Gold, 1 986) and this has been clearly shown by the disruption of the mRNA secondary structures 
around the AUG start site for effective translation (Devlin et al., 1988; Gheysen etal., 1982; Hall et al., 1982; Schoner 
et al.. 1984). The computer program (DNASIS V2.0) was also used to predict possible mRNA 5' secondary structures 

20 of the designed primers to avoid mistranslation. 

Thus, using computer models of various previously cloned genes, we found good correlation of protein synthesis 
to the predicted 5' end mRNA secondary structures. In general, thermodynamic values of about -2.0 kcal/mol or greater 
appeared to be favorable for protein synthesis. All the 5' end primers were designed based on these rationalizations. 
The primer for the 3' end of the gene is the same for all the deleted clones. It may be any priming site located in 

25 a 3' position to the internal Ncol site. In this example, the 3' end primer was located on the pTrc99A vector itself within 
the multicioning region (see Fig, 2). A large 3' end fragment of the B. stearothermophilus DNApoll gene had been 
cloned into this vector as described in Phang et al. (1995). This was termed PV8168. Thus DNA amplification was 
performed directly on this plasmid clone. 
The 3' end primer is: 

30 

PS; 5' -GCTTGCATGCCTGC7VGGTCG-3 ' ( SE Q ID NO: 11) 



35 EXAMPLE 5. 

Cloning of 5' deleted B. stearothermophilus DNApoll genes 

In vitro DNA manipulations were done by standard procedures (Berger, S.L et al. t 1987, "Guide to Molecular 

40 Cloning Techniques" Meth. Enz. 152: 393-399, 415-423, 432-449, 661-704; Sambrook etal., 1989, In Molecular Clon- 
ing: A Laboratory Manual, 2nd Ed. Cold Spring Harbour Press, New York). Restriction enzymes were from New England 
Biolabs, USA, and T4 DNA ligase was from Promega, USA. In this example, pTrc99A vector (Pharmacia Biotech, 
Sweden) will be used to describe a general scheme for cloning of all the 5* deleted genes. Placement of the B. stearo- 
thermophilus DNApoll gene under control of a trc promoter took advantage of Nco I site covering the ATG translational 

45 start sites of the pTrc99A vector (Phang et al., 1995). 

The expression vector pTrc99A contains a strong trc promoter that, when induced, will result in the overexpression 
of the protein. This promoter is controlled by the laclq repressor. Thus glucose must be added to the culture media to 
repress the basal level of expression in the uninduced state. Induction occurs with the addition of lmM IPTG (Amann 
et al. t 1988, Gene 69: 301-315). 

50 However as there is an internal Ncol restriction site in the B. stearothermophilus DNApoll gene (Phang, 1995), it 

was necessary to insert the gene into the pTrc99A vector in two stages. 

The first stage involves cloning the 3' end into the vector and this construct may be used to clone all the 5' deleted 
clones in the second stage (Fig. 3). A 3' end fragment of the B. stearothermophilus DNApoll gene spanning the internal 
Ncol restriction site was digested with Ncol restriction enzyme and directionaliy cloned into the pTrc99A vector pre- 

55 digested with Ncol + Smal restriction enzymes. This heterologous construct was now digested with Ncol to accept the 
5' deleted primers. The second stage involves the cloning of the various 5' end fragments into the first stage construct. 
The 5' deleted B. stearothermophilus DNApoll genes spanning the 5' end priming sites to the end of the gene was 
amplified and digested with Ncol. The 5' end fragments were ligated to the first stage construct. Orientation was checked 
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by either digesting the recombinant with Hindlll or DNA sequencing. The correct deleted clone should result in over- 
expression and synthesis of the correct truncated proteins. 



IPTG induction ol recombinant clones 

Clones were grown up at 370C overnight in L-broth with 1 % glucose and 200 ng/ml ampicillin. The following day, 
a 4 ml aliquot of culture was seeded into 200 ml of L-broth with 200 u^/ml ampicillin and grown at 37°C until 0.8 O.D.' 
600. The culture was induced with IPTG (1 mM final concentration) and further grown at 30°C for 3 hours. 

As we are attempting to eliminate the various possibilities for the lack of protein production, we need to abolish 
basal level synthesis of the protein in the control samples. It has been previously shown (Silverstone et al., 1 970, Proc. 
Natl. Acad. Sci. USA 66: 773-778) that the lacUVS promoter is insensitive to glucose repression of up to 0.4 %; thus 
basal levels of B. stearothermophilus DNApoll was also obtained in our controls (0.1 % glucose). An empirical assay 
was performed on overnight cultures of recombinants grown in various glucose concentrations. A 1%' glucose concen- 
tration was used in subsequent experiments. -■ :r v 

Clones were grown up at 370C overnight in LB medium with 1% glucose to repress constitutive expression from 
the trp/lac promoter, and 200 ng/ml ampicillin was added as a precaution. The following day, a 4 ml aiiquot was seeded 
into 200 ml of LB with ampicillin and grown until A600 nm= 0.8. IPTG was added to a final concentration of 1 mM and 
induced at 30°C for 3 hrs. This was spun down and resuspended in 40 ml 10mM Tris pH7.5. The pellet was stored 
overnight at -20oC, Overnight samples were pelleted and resuspended in 1 0 ml of 1 0 mM Tris pH7.5. this was sonicated 
at 8 - 10 W with a Vibra-cell (Sonics & Materials, Connecticut, USA), 10x each round for 1 min. The mixture was 
incubated at 60°C for 15 mins to denature the E. coli proteins and centrifuged at 12500 rpm for 10 mins to collect a 
clear supernatant (Engelke, 1990; Laywer, 1989; Phang, 1995). 



EXAMPLE 7. 

SDS-PAGE gel electrophoresis 

30 The molecular weight of the dialyzed product may be determined by any technique, lor example, by sodium do- 

decylsulfate-polyacrylamide gel electrophoresis (SDS-PAGE) using protein molecular weight markers. The series of 
truncated 8. stearothermophilus DNApoll enzymes that are deficient in 3'-5' exonuclease activity purified by the above 
method in EXAMPLE 5 have relative molecular weights, determined by SDS-PAGE under denaturing conditions, of 
about 65.6, 56. 1 f 47. 1 , 38.4, and 29.2 kDa. 

35 Samples were analyzed with the Bio-Rad Mini-Protein II Apparatus to check for protein synthesis (Hames and 

Rickwood, 1 989, In Gel Electrophoresis of Proteins; A Practical Approach, IRL, Oxford). 1 ml of cell sample was pelleted 
and resuspended in 100 uJ Laemmli buffer (0.05 M DTT, 0.125 M Tris pH 6.8, 50 % glycerol (v/v), 0.05 % bromophenol 
blue (w/v), 1 % SDS). The mixture was boiled for 5 min, and centrifuged at 1 4000 rpm for 5 min. 20 uJ of the supernatant 
was loaded onto 12% acrylamide gels with a 5% stacking gel and electrophoresed at 120 V for 60-90 min, then at 

^0 constant current 20 mAmp for 1 hour in Tris/glycine/SDS buffer (25 mM Tris pH 8.3, 1 92 mM glycine, 0. 1 % SDS). Gels 
were stained in Coomassie Blue (0.1% Coomassie Blue R-250 (w/v) ( 10% acetic acid, 45% methanol, 45% H20) for 
2 hr. then destained (10% acetic acid, 45% methanol, 45% H20) and dried down in a vacuum at 60°C for 2hrs between 
two pieces of wet cellophane. 

45 EXAMPLE 8. 

Assay for 8. stearothermophilus DNApoll Activity 

The collected fractions were separately assayed for DNA polymerase I activity (Setlow et al.. 1972, J. Biol. Chem. 

50 247: 224-231; Setlow and Kornberg, 1972, J. Biol. Chem. 247: 232-240). The reaction mix contained 20 mM Tris pH 
8.5, 20 mM MgCI2, 1 mM each dCTR dGTP, dTTP, 1 .25 jig/ul activated DNA (Pharmacia), 1000-15000 Ci/mM [35S]- 
dATP, 1 -1 0 units of enzyme, and water to 0.3 ml. The assay is initiated by the addition of enzyme, and the mixtures 
were incubated for 5 mins at 65°C. The reaction was stopped by addition of 1 ml chilled 10% TCA/50 mM sodium 
pyrophosphate. After 10 minutes at 4°C, the solution was filtered through a glass fibre (Whatman GF/C, 2.4 cm), 

£ s washed 5 times with 5 ml chilled 10% TCA/50 mM sodium pyrophosphate, and then 3 times with 5 ml ethanol. The 
filters were dried under suction for two minutes and then counted in a scintillation counter An incubation without enzyme 
is included in each set of assay (One unit of polymerase activity catalyzes the incorporation of 10 nmol total nucleotides 
into a DEAE-precipitable form in 30 minutes at 72°C.) As described herein, the clones designated PV8168, PV8169, 
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PV8170, PV8171, PV8172, were determined to retain their DNA polymerase activity when compared to wild type a 
stearothermophil 'us DNApoll. Indeed, PV8168 retained 100% of its DNA polymerase activity. 

EXAMPLE 9. 

DNA Sequencing 

DNA Sequencing Reactions: 

1 . Mix the following in a 1 .5 ml microcentrifuge tube: 



5x Reaction Buffer 


2.0 pl 




primer 


1.0 ul 


(2.5-5.0 ng) 


ssDNA template 




(250-500 ng) 


dH 2 0 


to 10 pl 





2. Place in a 75°C water bath for 5 minutes, then cool slowly to room temperature (about 5 minutes). Note: this 
step is optional for single-strand templates only, and it can be omitted. 

3. During the annealing, label 4 tubes "A*, m C\ "G', and "T"; add 2.0 ul of each pre-mixed nucleotide solution to 
the respective tube, and pre-warm to 65°C. 

4. Add to the annealed reaction (from step 2): 



Bst DNA polymerase 


1.0 pi | (1.0 unit) 


[a-35S]dATP 


1.5 |il 



Mix gently, spin 2-3 seconds in a microcentrifuge, and aliquot 2.5 jil to each of the tubes (from step 3). 

5. Incubate at 650C for 2 minutes. 

6. Dilute 1 .0 pl of 20x Chase Solution into 19 ul of water, 8.0 pi of diluted Chase is needed per reaction set. 

7. Add 2.0 pi of diluted Chase to each tube, mix gently, and incubate at 65°C for 2 minutes. 

8. Stop the reactions by adding 4.0 pi of Stop Solution. 

9. Denature the samples at >75°C lor 1-2 minutes immediately prior to loading. 

DOUBLE STRAND TEMPLATE (PLASMID) SEQUENCING PROTOCOL 

1. Mix the following in a 1.5 ml microcentrifuge tube: 



dsDN A template 


10 pi | (1-3pg) 


0.4N NaOH, 0.4mM EDTA 


10 pi 



2. Incubate at 75°C for 5 minutes. 

3. Add 2.0 pi of 3 M ammonium acetate, pH 5.2, and 50 pl of cold ethanol. Place tube in -80°C for 15 minutes. 

4. Spin in a microcentrifuge at 1 2,000 rpm, 4°C for 5 minutes. 

5. Wash pellet with 200 pl of cold 70% ethanol. Spin 2-3 minutes and remove the supernatant. Note: this step is 
optional and may be omitted. 

6. Dry pellet under vacuum for 3-5 minutes, then resuspend in 7.0 pl TE. 

7. Proceed with annealing and sequencing as described above. 

Polyacrylamide sequencing gels were prepared (6% gel having 5.7% acrylamide 0.3% bis-acrylamide/7M Urea/ 
IX TBE), preheated to 50°C, using a power supply regulated to 1 20 watts (W). The denatured sample (2pl) was loaded 
onto the gel and electrophoresed at 55°C until the bromophenol blue dye front migrated to the bottom of the gel. The 
gel assembly was taken apart, the gel transferred to Whatmann 3 MM paper dried and exposed to X-ray film for 1 2-20 
hrs. 
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EXAMPLE 10. 

Large scale production and purification of the recombinant truncate B. stearothermophilus DNApoll proteins 

5 An overnight culture of £ co// clones in sterile LB containing 100 ug/ml ampicillin and 34 u.g/ml chloramphenicol 

was seeded 1:50 in a 1 L Belco culture flask containing 100 u.g/ml ampicillin. This was shaken at 37°C, 250 rpm until 
OD600 reached 0.6. This was induced with 1 mM IPTG and shaken for another 3 hrs at 30°C. The sample was spun 
at 8000 rpm, 4°C for 5 minutes. The supernatant was discarded and the pellet was stored in -20°C. The pellet was 
resuspended in 25-30 ml 10 mM Tris pH 7.5 per falcon tube and 0.02 mM PMSF was added to the cell suspension. 

jo This was sonicated at 8 - 10 W with a Vibra-cell (Sonics & Materials, Connecticut, USA), 10x each round for 1 min. 
The sonicated cell suspension was incubated in 60°C for 15 min. The sample was centrif uged in a JA-20 (Beckmann, 
USA), 1 3000 rpm for 30 min at 4°C. The clear supernatant was loaded onto an ion-exchange DE-52 chromatographic 
column using 10 mM Tris pH 7.5 as buffer and eluted in a gradient (0-0.3 M NaCI)over 80 fractions (6 ml each). Purified 
fractions were checked on SDS-PAGE for protein concentrations (Deutscher, M P, 1990, Meth. Enz. 182: 738-751). 

75 Typically, the purified protein is dialyzed after purification against a low salt buffer, e.g., 10 mM Tris pH 7.5, 1 mM 
dithiothreitol. Typically, a stabilizing agent, such as glycerol, is added to the preparation to facilitate low temperature 
storage of the purified, recombinant enzyme. 

In preferred embodiments, S. stearothermophilus DNApoll is used in combination with a thermostable buffer such 
as 20 mM Tris pH 8.5, 20 mM MgCI2, 1 mM each dCTP, dGTP, dTTR 1.25 ng/uJ activated DNA (Pharmacia), 

20 1 000-1 5000 Ci/mM [35S]-dATP. Thus, the invention also contemplates compositions containing an enzymatically active 
amount of an Bst DNA polymerase in a compatible buffer. 

Production of recombinant a stearothermophilus DNApoll proteins is typically produced by recombinant DNA 
techniques from a gene encoding the modified enzyme. The precise nucleotide sequence is not critical as long as the 
resulting encoded protein has the requisite properties. 

25 The DNA segment can then be operatively linked to a vector to create a recombinant DNA molecule using well 

known techniques. The nucleotide sequence is joined to the vector so that the sequence is under the transcriptional 
and translational control ol the express ion vector. The vector is then used to transform a suitable host and the trans- 
formed host is cultured under favorable conditions to effect the production of the recombinant 8. stearothermophilus 
DNApoll by expression of the gene and subsequent protein production in the compatible transformed host. The details 

30 of the recombinant techniques and the transformation of cells are provided above and use techniques that are well 
known to the skilled artisan. 

The invention shall now be described by way of the following numbered paragraphs: 

1. An isolated nucleic acid molecule encoding a Bacillus stearothermophilus DNA polymerase I enzyme having 
55 reduced 3' to 5' exonuclease activity. 

2. The isolated nucleic acid molecule of paragraph 1 comprising a Bacillus stearothermophilus DNA polymerase 
I gene having deletions, inactivating insertions, or mutations at the 5' end of the molecule. 

3. A Bacillus stearothermophilus DNA polymerase I enzyme encoded by the isolated DNA molecule of paragraph 1 . 

4. The enzyme of paragraph 3, wherein the enzyme has DNA synthesis activity but is deficient in 3' to 5' exonuclease 
40 activity. 

5. A Bacillus stearothermophilus DNA polymerase I which has DNA synthesis activity but is deficient in 3' to 5' 
exonuclease activity 

6. An isolated DNA molecule comprising a nucleotide sequence as in Figure 1 (SEQ ID NO: 1) having deletions 
from the 5' end of the molecule such that the DNA molecule encodes an enzyme which retains DNA synthesis 

45 activity but is deficient in 3* to 5' exonuclease activity. 

7. A Bacillus stearothermophilus DNA polymerase I enzyme comprising the amino acid sequence as in Figure 1 
(SEQ ID NO: 2) which is truncated such that the enzyme retains DNA synthesis activity but exhibits reduced 3' to 
5' exonuclease activity. 

8. The isolated DNA molecule of paragraph 6, wherein nucleotides 1 to 1 077 of Figure 1 (SEQ ID NO: 1 ) are deleted. 
so g. The isolated DNA molecule of paragraph 6, which begins at nucleotide 1078 of Figure 1 (SEQ ID NO: 1). 

10. The isolated DNA molecule of paragraph 6, wherein nucleotides 1 to 1332 of Figure 1 (SEQ ID NO: 1) are 
deleted. 

11. The isolated DNA molecule of paragraph 6, which begins at nucleotide 1333 of Figure 1 (SEQ ID NO; 1). 

12. The isolated DNA molecule of paragraph 6, wherein nucleotides 1 to 1577 of Figure 1 (SEQ ID NO: 1) are 
s 5 deleted. 

13. The isolated DNA molecule of paragraph 6, which begins at nucleotide 1578 of Figure 1 (SEQ ID NO: 1). 

14. The isolated DNA molecule of paragraph 6, wherein nucleotides 1 to 1815 of Figure 1 (SEQ ID NO: 1) are 
deleted. 
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15. The isolated DNA molecule of paragraph 6, which begins at nucleotide 1816 of Figure 1 (SEQ ID NO: 1). 

16. The isolated DNA molecule of paragraph 6, wherein nucleotides 1 to 2067 of Figure 1 (SEQ ID NO; 1) are 
deleted. 

17. The isolated DNA molecule of paragraph 6, which begins at nucleotide 2068 of Figure 1 (SEQ ID NO: 1 ). 

18. The DNA polymerase enzyme of paragraph 7, wherein amino acids 1 to 285 of Figure 1 (SEQ ID NO: 2) are 
deleted. 

19. The DNA polymerase enzyme of paragraph 7, which begins at amino acid 286 of Figure 1 (SEQ ID NO: 2). 

20. The DNA polymerase enzyme of paragraph 7, wherein amino acids 1 to 370 of Figure 1 (SEQ ID NO: 2) are 
deleted. 

21 . The DNA polymerase enzyme of paragraph 7, which begins at amino acid 371 of Figure 1 (SEQ ID NO: 2). 

22. The DNA polymerase enzyme of paragraph 7, wherein amino acids 1 to 452 of Figure 1 (SEQ ID NO: 2) are 
deleted. 

23. The DNA polymerase enzyme of paragraph 7, which begins at amino acid 453 of Figure 1 (SEQ ID NO: 2). 

24. The DNA polymerase enzyme of paragraph 7, wherein amino acids 1 to 531 of Figure 1 (SEQ ID NO: 2) are 
deleted. 

25. The DNA polymerase enzyme of paragraph 7, which begins at amino acid 532 of Figure 1 (SEQ ID NO: 2). 

26. The DNA polymerase enzyme of paragraph 7, wherein amino acids 1 to 615 of Figure 1 (SEQ ID NO: 2) are 
deleted. 

27. The DNA polymerase enzyme of paragraph 7, which begins at amino acid 616 of Figure 1 (SEQ ID NO: 2). 

28. The DNA polymerase enzyme of any one of paragraphs 7, 18, 19, 20, 21, 22, 23, 24 ; 25, 26, or 27, wherein 
the enzyme retains DNA synthesis activity but is deficient in 3' to 5* exonuclease activity. 

29. A Bacillus stearothermophilus DNA polymerase I enzyme from expression of the isolated nucleic acid molecule 
of any one of paragraphs 6, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17 or 18. 

30. The Bacillus stearothermophilusDHtK polymerase I enzyme of paragraphs 5 or 7 from expression of exogenous 
DNA encoding the enzyme. 

31 . A method for producing a Bacillus stearothermophilus DNA polymerase I having a processive DNA synthesis 
activity and a substantially reduced 3'-5' exonuclease activity comprising the steps of: 

(1 ) isolating a DNA polymerase I enzyme from Bacillus stearothermophilus', and 

(2) deleting a portion of the DNA polymerase I to create a truncated enzyme. 

32. The method of paragraph 31 further comprising: 

(3) transforming a vector to contain and express a DNA molecule encoding the truncated enzyme; and, 

(4) isolating expressed the DNA polymerase I enzyme from the transformed vector. 

33. A method for producing a Bacillus stearothermophilus DNA polymerase I having a processive DNA synthesis 
activity and a substantially reduced 3'-5' exonuclease activity comprising the steps of: 

(1) isolating a DNA encoding DNA polymerase I enzyme from Bacillus stearothermophilus, 

(2) deleting from the 5' end of the DNA to create DNA encoding a truncated enzyme; 

(3) culturing a host cell to containing a recombinant molecule comprising: 

(a) DNA encoding the truncated enzyme and 

(b) a promoter wherein said promoter and said DNA encoding the truncated enzyme are in such position 
and orientation with respect to each other that said DNA may be expressed in a host cell under control of 
said promoter; and 

(4) isolating the truncated enzyme from the host cell. 

34. A vector containing the isolated nucleic acid molecule or DNA molecule as in any one of paragraphs 1, 2, 6, 
8, 9, 10, 11, 12, 13, 14, 15, 16 or 17. 

35. The vector of paragraph 34 which is a plasmid. 

36. In a method for sequencing DNA or labelling DNA wherein the improvement comprises using a Bacillus stearo- 
thermophilus DNA polymerase I enzyme. 

37. The method of paragraph 36 wherein the enzyme has DNA synthesis activity has reduced in 3' to 5' exonuclease 
activity. 

38. The method of paragraph 36 wherein the enzyme is from expression of an isolated DNA molecule comprising 



22 



EP 0 875 576 A2 



a nucleotide sequence as in Figure 1 (SEQ ID NO: 1) having deletions from the 5 1 end of the molecule such that 
the DNA molecule encodes an enzyme which retains DN A synthesis activity but is deficient in 3" to 5' exonuclease 
activity. 

39. The method of paragraph 36 wherein the enzyme is a Bacillus stearothermophilus DNA polymerase I enzyme 
5 comprising the amino acid sequence as in Figure 1 (SEQ ID NO: 2) which is truncated such that the enzyme retains 

DNA synthesis activity but exhibits reduced 3' to 5' exonuclease activity. 

40. The method of paragraph 38 wherein in the isolated DNA molecule, nucleotides 1 to 1077 of Figure 1 (SEQ 
ID NO: 1 ) are deleted. 

41 . The method of paragraph 38 wherein the isolated DNA molecule begins at nucleotide 1078 of Figure 1 (SEQ 
io IDNO:1). 

42. The method of paragraph 38 wherein in the isolated DNA molecule, nucleotides 1 to 1332 of Figure 1 (SEQ 
ID NO: 1) are deleted. 

43. The method of paragraph 38 wherein the isolated DNA molecule begins at nucleotide 1333 of Figure 1 (SEQ 
IDNO:1). 

'5 44. The method of paragraph 38 wherein in the isolated DNA molecule, nucleotides 1 to 1577 of Figure 1 (SEQ 

ID NO: 1) are deleted. 

45. The method of paragraph 38 wherein the isolated DNA molecule begins at nucleotide 1578 of Figure 1 (SEQ 
IDNO:1). . 

46. The method of paragraph 38 wherein in the isolated DNA molecule nucleotides 1 to 1815 of Figure 1 (SEQ ID 
20 NO: 1) are deleted. 

47. The method of paragraph 38 wherein the isolated DNA molecule begins at nucleotide 1816 of Figure 1 (SEQ 
ID NO: 1). 

48. The method of paragraph 38 wherein in the isolated DNA molecule nucleotides 1 to 2067 of Figure 1 (SEQ ID 
NO: 1)are deleted. 

25 49. The method of paragraph 38 wherein the isolated DNA molecule begins at nucleotide 2068 of Figure 1 (SEQ 

IDNO:1). 

50. The method of paragraph 39 wherein in the DNA polymerase enzyme, amino acids 1 to 285 of Figure 1 (SEQ 
ID NO: 2) are deleted. 

51 . The method of paragraph 39 wherein the DNA polymerase enzyme begins at amino acid 286 of Figure 1 (SEQ 
30 ID NO: 2). 

52. The method of paragraph 39 wherein in the DNA polymerase enzyme, amino acids 1 to 370 of'Figure 1 (SEQ 
ID NO: 2) are deleted. 

53. The method of paragraph 39 wherein the DNA polymerase enzyme begins at amino acid 371 of Figure 1 (SEQ 
ID NO: 2). 

35 54. The method of paragraph 39 wherein in the DNA polymerase enzyme, amino acids 1 to 452 of Figure 1 (SEQ 

ID NO: 2) are deleted, 

55. The method of paragraph 39 wherein the DNA polymerase enzyme begins at amino acid 453 of Figure 1 (SEQ 
ID NO: 2). 

56. The method of paragraph 39 wherein in the DNA polymerase enzyme, amino acids 1 to 531 of Figure 1 (SEQ 
40 ID NO: 2) are deleted. 

57. The method of paragraph 39 wherein the DNA polymerase enzyme begins at amino acid 532 of Figure 1 (SEQ 
ID NO: 2). 

58. The method of paragraph 39 wherein in the the DNA polymerase enzyme, amino acids 1 to 61 5 of Figure 1 
(SEQ ID NO: 2) are deleted. 

4 & 59. The method of paragraph 39 wherein the DNA polymerase enzyme begins at amino acid 61 6 of Figure 1 (SEQ 

ID NO: 2). 

60. The method of paragraph 36 which is automated. 

61 . The method of paragraph 36 which is for DNA sequencing. 

62. The method of paragraph 36 which is for DNA labelling. 

50 

Having thus described in detail preferred embodiments of the present invention, it is to be understood that the 
invention defined by the appended claims is not to be limited to particular details set forth in the above description as 
many apparent variations thereof are possible without departing from the spirit or scope of the present invention. 
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Claims 

1. An isolated nucleic acid molecule encoding a Bacillus stearothermophilus DNA polymerase I enzyme having re- 
25 duced 3* to 5* exonuclease activity. 

2. The isolated nucleic acid molecule of claim 1 comprising a Bacillus stearothermophilus DNA polymerase I gene . 
having deletions, inactivating insertions, or mutations at the 5' end of the molecule. 

30 3. a Bacillus stearothermophilus DNA polymerase I enzyme encoded by the isolated DNA molecule of claim 1 . 

4. A Bacillus stearothermophilus DNA polymerase I which has DNA synthesis activity but is deficient in 3' to 5* exo- 
nuclease activity. 

35 5. An isolated nucleic molecule of claims 1 or 2 comprising a DNA molecule comprising a nucleotide sequence as 
in Figure 1 (SEQ ID NO: 1 ) having deletions from the 5* end of the molecule such that the DNA molecule encodes 
an enzyme which retains DNA synthesis activity but is deficient in 3* to 5' exonuclease activity. 

6. A Bacillus stearothermophilus DNA polymerase I enzyme of claims 3 or 4 comprising the amino acid sequence 
40 as in Figure 1 (SEQ ID NO: 2) which is truncated such that the enzyme retains DNA synthesis activity but exhibits 

reduced 3' to 5' exonuclease activity. 

7. The isolated DNA molecule of claim 5, 

45 wherein nucleotides 1 to 1077 of Figure 1 (SEQ ID NO: 1) are deleted; or 

which begins at nucleotide 1078 of Figure 1 (SEQ ID NO: 1 ); or 

wherein nucleotides 1 to 1332 of Figure 1 (SEQ ID NO: 1) are deleted; or 

which begins at nucleotide 1333 of Figure 1 (SEQ ID NO: 1); or 

wherein nucleotides 1 to 1577 of Figure 1 (SEQ ID NO: 1). are deleted; or 
50 which begins at nucleotide 1578 of Figure 1 (SEQ ID NO: 1); or 

wherein nucleotides 1 to 1815 of Figure 1 (SEQ ID NO: 1). are deleted; or 

which begins at nucleotide 1816 of Figure 1 (SEQ ID NO: 1); or 

wherein nucleotides 1 to 2067 of Figure 1 (SEQ ID NO: 1). are deleted; or 

which begins at nucleotide 2068 of Figure 1 (SEQ ID NO: 1). 

55 

8. The DNA polymerase enzyme of claim 6, 

wherein amino acids 1 to 285 of Figure 1 (SEQ ID NO; 2) are deleted; or 




27 



EP 0 875 576 A2 

which begins at amino acid 286 of Figure 1 (SEQ ID NO: 2); or 
wherein amino acids 1 to 370 of Figure 1 (SEQ ID NO: 2) are deleted; or 
which begins at amino acid 371 of Figure 1 (SEQ ID NO: 2); of 
wherein amino acids 1 to 452 of Figure 1 (SEQ ID NO: 2) are deleted; or 
5 which begins at amino acid 453 of Figure 1 (SEQ ID NO: 2); or 

wherein amino acids 1 to 531 of Figure 1 (SEQ ID NO: 2) are deleted; or 
which begins at amino acid 532 of Figure 1 (SEQ ID NO: 2); or 
wherein amino acids 1 to 615 of Figure 1 (SEQ ID NO; 2) are deleted; or 
which begins at amino acid 616 of Figure 1 (SEQ ID NO: 2). 

10 

9. A Bacillus stearothermophilus DNA polymerase I enzyme from expression of the isolated nucleic acid molecule 
of any one of claims 5 or 7. 

10. The Bacillus stearothermophilus DNA polymerase I enzyme of claims 6 or 8 from expression of exogenous DNA 
'5 encoding the enzyme. 

11. A method for producing a Bacillus stearothermophilus DNA polymerase I having a processive DNA synthesis 
activity and a substantially reduced 3*-5' exonuclease activity comprising the steps of: 

20 (1 ) isolating a DNA polymerase I enzyme from Bacillus stearothermophilus; and 

(2) deleting a portion of the DNA polymerase I to create a truncated enzyme; and optionally 

further comprising: 

25 (3) transforming a vector to contain and express a DNA molecule encoding the truncated enzyme; and, 

(4) isolating expressed the DNA polymerase I enzyme from the transformed vector. 

12. A method for producing a Bacillus stearothermophilus DNA polymerase I having a processive DNA synthesis 
activity and a substantially reduced 3'-5' exonuclease activity comprising the steps of: 

30 

(1) isolating a DNA encoding DNA polymerase I enzyme from Bacillus stearothermophilus, 

(2) deleting from the 5' end of the DNA to create DNA encoding a truncated enzyme; 

(3) culturing a hcst cell to containing a recombinant molecule comprising: 

35 (a) DNA encoding the truncated enzyme and 

(b) a promoter wherein said promoter and said DNA encoding the truncated enzyme are in such position 
and orientation with respect to each other that said DNA may be expressed in a host ceil under control of 
said promoter; and 

40 (4) isolating the truncated enzyme from the host cell. 

13. A vector containing the isolated nucleic acid molecule or DNA molecule as claimed in any one of claims 1 , 2, 5 or 7. 

14. The vector of claim 13 which is a plasmid. 

45 

15. Use in sequencing DNA or labelling DNA of a Bacillus stearothermophilus DNA polymerase enzyme. 

16. The use of claim 15 wherein the enzyme has DNA synthesis activity has reduced in 3' to 5' exonuclease activity. 

so 17. The use of claim 1 6 wherein the enzyme is from expression of a nucleic acid molecule or DNA molecule as claimed 
in any one of claims 1 , 2, 5 or 7, or wherein the enzyme is as claimed in any one of claims 3, 4, 6, 8, 9 or 10, or 
wherein the enzyme is from a method as claimed in any one of claims 11 or 12, or wherein the enzyme is from 
expression of a vector as claimed in any one of claims 1 3 or 14. 

55 18. The method of claim 17 which is automated. 

19. The method of claim 18 which is for DNA sequencing. 
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20. The method of claim 18 which is for DNA labelling. 
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